Technical Reference · AUG-RP-OPT-001

AI Well Optimization
Rod Pump Lift

Data requirements mapped to the Augmentify five-phase lifecycle — from dynagraph card audit through to real-time pump fillage optimization, gas interference detection, and predictive rod failure identification.

Beam Pump & PCP Pump Fillage Optimization Dynagraph Analysis Gas Interference Detection
Document Info
ReferenceAUG-RP-OPT-001
Revision1.0
TypeTechnical Ref
Lift TypeRod Pump
Phases01 – 05
Data Requirements Matrix — By Phase
Data Category PH.01Initiate PH.02Assess PH.03Select PH.04Define PH.05Execute
Production & Reservoir Data
Fluid Properties & PVT
Rod String & Pump Configuration
Surface Unit & Drive Data
Dynagraph & Load/Position Data
Real-Time SCADA & Motor Data
IPR & Well Test Data
Failure & Intervention History
Operational Constraints
Required at this phase
Partially required / preliminary
Production & Reservoir
Fluid & PVT
Equipment & Configuration
Dynagraph & SCADA
Operations & Constraints
By Phase

Rod Pump Optimization — Data Requirements

Rod pump wells are the most widely deployed artificial lift method in U.S. onshore operations and among the most data-rich — dynagraph (surface and downhole) card data carries a diagnostic signature for every failure mode from pump-off to rod parting to gas interference. The data requirements below reflect the unique value and unique complexity of dynagraph-based AI optimization.

Phase 01 Initiate Scoping the rod pump optimization opportunity

Establishes whether AI rod pump optimization is technically viable for the asset. At this stage, the key questions are the scope of instrumentation coverage (particularly dynagraph card collection), the frequency and severity of rod and pump failures, and the degree to which stroke/speed settings are currently being actively managed.

Production & Reservoir — Preliminary
  • Field-level and per-well oil, gas, water production rates — historical trend
  • Number of rod pump wells in scope and their operating status
  • Reservoir drive mechanism and depletion stage
  • Production decline profile and estimated deferral from pump-off events
  • Known high-GOR or high-viscosity wells — flags for gas interference or fluid pound risk
Rod Pump System — Preliminary
  • Beam pump unit inventory — API unit size, gear rating, counterbalance type
  • Downhole pump type (tubing vs. rod pump), size, and plunger clearance class
  • Rod string design — grade, size taper, rod count per well
  • VFD or multi-speed drive presence and operating range
  • Current stroke length and stroke speed (SPM) settings per well
  • Dynagraph card collection capability — surface load cell and position sensor status
Operations & Constraints — Preliminary
  • Current pump-off control (POC) system type and settings
  • Current optimization workflow — manual, POC-automated, or unmanaged
  • Surface facility capacity limits
  • IT/OT integration landscape and historian availability
Key Questions to Answer at Initiate
? Do wells have load cells and position sensors for dynagraph card capture?
? How frequently are dynagraph cards collected — continuous or periodic?
? What is the annual rod failure and pump repair rate across the asset?
? Are VFDs or multi-speed drives installed, enabling stroke speed optimization?
Phase Deliverables
Data Availability Assessment Optimization Opportunity Statement Rod Pump Well Inventory Dynagraph Coverage Assessment Project Charter
Phase 02 Assess Data audit, dynagraph quality & model feasibility

Full technical audit of the rod pump system and supporting data. This phase goes deeper than standard production data review — it audits the dynagraph card library, validates rod string designs, assesses pump fill efficiency history, and determines whether the historian and card data have the quality and volume to support an AI model for pump-off control, fillage optimization, and failure prediction.

Dynagraph card quality is the critical constraint: The dynagraph card is the primary diagnostic tool for every rod pump failure mode — fluid pound, pump-off, gas interference, rod part, and valve leak all have distinct card signatures. A model trained on cards with missing, mislabeled, or low-resolution data will misclassify failure modes. Card library quality must be assessed before any model architecture decision is made.
Production & Reservoir Data
  • Per-well oil, gas, and water production rates — daily and hourly where available
  • Wellhead pressure and flowing tubing head pressure (FTHP) history
  • Casing head pressure (CHP) history — gas cap and annular gas management
  • Flowing bottomhole pressure — measured or pump intake pressure derived
  • Static bottomhole pressure and reservoir pressure surveys
  • Well test history — frequency, method, last test date per well
  • Pump submergence and fluid level history (fluid level shots where available)
Fluid Properties & PVT
  • Oil API gravity and dead oil viscosity by well — viscosity critical for pump selection and rod loading
  • Solution GOR, producing GOR history, and bubble point pressure
  • Full PVT analysis — Bo, Rs, µo (viscosity vs. temperature curve for waxy/heavy oil wells)
  • Water cut history and produced water specific gravity
  • Free gas at pump intake — calculated from producing GOR and pump intake pressure
  • Paraffin and scale deposition history — impacts pump and rod operating envelope
Rod String & Pump Configuration
  • Rod string design — grade, diameter taper (from surface to pump), rod count per section
  • Rod string API design report — polished rod load, torque analysis, safety factors
  • Downhole pump — type, bore size, plunger clearance, barrel length, valve type
  • Pump setting depth (MD and TVD) and seating nipple specification
  • Tubing size, tubing anchor depth, and packer presence
  • Surface unit — API unit designation, stroke length, peak torque rating
  • Counterbalance type (rotary vs. conventional) and counterbalance effect (CBE) measurements
  • VFD or multi-speed drive — frequency range and current speed settings
Dynagraph & Card Data — Audit
  • Surface dynagraph card library — date, well, load (lbf) and position (in) data pairs
  • Card collection frequency — continuous, periodic, or event-triggered
  • Downhole dynagraph cards (wave equation derived) where available
  • Card labeling history — pump-off, full pump, fluid pound, gas interference, rod part, valve leak labels
  • Card resolution — data points per stroke cycle
  • Load cell and position sensor calibration records and known drift events
  • POC system type and pump-off detection method (card shape vs. peak torque limit)
Real-Time SCADA & Motor Data
  • Historian system type and tag inventory
  • Motor current and power consumption history per well
  • Stroke per minute (SPM) history and VFD frequency history
  • Polished rod torque and peak gear box torque trends
  • Data completeness — % tag uptime over last 3 years per well
  • Known sensor dropout events and calibration records
Failure & Intervention History
  • Rod failure log — date, failure mode, depth of failure, operating conditions, dynagraph card at failure
  • Pump repair/replacement log — failure mode found at surface: worn plunger, scored barrel, valve failure
  • Tubing leak history — associated with rod wear at couplings
  • Scale, paraffin, and corrosion event records per well
  • Workover history — rod string redesign, pump resizing, tubing changes
Data Quality Thresholds for Model Viability
Minimum 2 years of dynagraph card history per well — continuous collection preferred
>200 labeled cards per well for supervised pump condition classification
At least 15–20 documented rod failure events with pre-failure card history
>85% tag uptime on SPM, motor current, and load cell data
Sparse or unlabeled card libraries require a retrospective labeling program before model training
Phase Deliverables
Data Audit Report Dynagraph Card Library Assessment Rod String Design Register Data Quality Assessment Gap Analysis & Remediation Plan Model Feasibility Assessment Risk Register
Phase 03 Select Model architecture, data pipeline & optimization approach

Audit findings determine the optimization architecture. Rod pump AI optimization spans three distinct problem types — pump fillage optimization (stroke/speed tuning), pump condition classification (dynagraph card pattern recognition), and predictive rod failure detection — and the architecture must address each based on the data available. Model selection is also governed by whether VFDs are installed enabling real-time setpoint actioning.

Data Pipeline & Integration
  • POC / SCADA system API access and card data export format
  • Real-time dynagraph card streaming capability — card capture frequency and format (CSV, binary)
  • Historian tag access for SPM, torque, current, and production allocation data
  • Data lake target for card image and time-series storage
  • Latency requirements for pump-off control loop — POC setpoint write-back pathway
  • Cybersecurity and OT segmentation constraints
Training Dataset Definition
  • Feature set — card shape parameters (peak load, minimum load, area, shape index), SPM, fluid level, water cut
  • Label definition — pump condition class (full pump, pump-off, gas interference, fluid pound, valve leak, rod part warning)
  • Classification vs. regression split — condition classification and fillage prediction as separate models
  • Card preprocessing strategy — normalization to standard load/position envelope per well
  • Time-series features from SCADA for rod fatigue accumulation model
  • Cross-well transfer learning potential — pump and rod design homogeneity assessment
Rod & Pump Physics Inputs
  • Wave equation model inputs — rod string data, fluid properties, pump geometry (for Gibbs or API RP 11L-based simulation)
  • Predicted downhole dynagraph cards from wave equation for physics-informed hybrid model
  • Rod fatigue accumulation model — rod loading history and API modified Goodman diagram
  • Pump fillage efficiency curves — theoretical vs. actual at varying SPM and fluid level
  • Motor torque loading envelope — gear box torque limit and counterbalance optimization inputs
Constraint Data for Optimization
  • Minimum pump submergence — fluid level floor above pump intake
  • VFD operating range — min/max SPM limits and ramp rate constraints
  • Peak gear box torque limit — must not be exceeded at any stroke point
  • Surface facility capacity constraints per stream
  • Rod fatigue life budget — operating constraint to protect rod run life
Architecture Selection Drivers
VFD installed → real-time stroke speed optimization via AI setpoint recommendation
Large labeled card library → convolutional neural network (CNN) for card classification
Sparse labeled cards → wave equation physics-informed hybrid model
High rod failure rate → rod fatigue accumulation model as primary value driver
Phase Deliverables
Model Architecture Decision Card Data Pipeline Design Feature Engineering Specification Technology Stack Selection Vendor / Platform Assessment
Phase 04 Define Data preparation, model development & validation baseline

All data is cleaned, cards labeled, and models built and validated. Rod pump model validation must go beyond production accuracy metrics — the card classification model must be tested against known failure events, and the pump fillage model must be validated against observed pump-off events and fluid level shot data. Physics consistency is a required validation check.

Retrospective card labeling: If the existing card library lacks condition labels, a structured retrospective labeling exercise by an experienced rod pump engineer must precede model training. This requires 1–3 weeks depending on asset size. The card labeling protocol should define consensus rules for ambiguous borderline conditions (e.g., partial pump-off vs. gas interference). Model accuracy is only as good as the label quality.
Cleaned & Labeled Training Dataset
  • Cleaned dynagraph card dataset — normalized load/position, outliers removed, incomplete cards flagged
  • Condition-labeled card library — expert-reviewed labels with consensus protocol applied
  • Failure-correlated card sequences — cards in the 24–72 hours preceding rod failure events
  • SPM, torque, and motor current time-series aligned to card timestamps
  • Pump fillage efficiency labels computed from card area and wave equation model
  • Fluid level shot data integrated for pump submergence correlation
Validation & Benchmarking Data
  • Held-out card sequences from known failure events — withheld from training
  • Well test data for production rate prediction validation
  • Wave equation simulation outputs for hybrid model calibration
  • Historical SPM change events with documented production response
  • Operator log entries correlated with pump condition events
Reservoir & Fluid Updates
  • Latest PVT data — especially viscosity updates for heavy or waxy oil wells
  • Updated reservoir pressure and fluid level surveys
  • Revised IPR curves post any stimulation or workover
  • Updated water cut trends — affects pump fillage efficiency significantly
Acceptance Criteria Data
  • Baseline KPIs: pump-off frequency, average dynagraph fill percentage, rod failure rate, average pump run life
  • Card classification accuracy threshold agreed with operations (>90% on full pump / pump-off classes)
  • Rod failure recall threshold — number of warnings before failure is the primary metric
  • POC setpoint integration test — verified write-back to pump controller
Model Acceptance Gate — Data-Driven Criteria
Card classification accuracy >90% for full pump, pump-off, and fluid pound classes
Rod failure warning issued >24 hours before failure in >70% of held-out events
Production rate prediction MAPE <10% on validation wells
Optimization recommendations validated against >6 months historical SPM change events
Phase Deliverables
Labeled Card Library Trained Card Classification Model Rod Failure Prediction Module Fillage Optimization Model Model Performance Report Backtesting Results Deployment Specification
Phase 05 Execute Deployment, live optimization & continuous learning

The model deploys into the live production environment. Dynagraph cards are classified in real time as they arrive from the POC system, stroke speed recommendations are generated continuously, and rod fatigue accumulation is tracked per well. New failure events are labeled and fed back into the continuous learning loop. Data governance is critical — rod string redesigns or pump changes that go unlogged will silently degrade model accuracy.

Live Real-Time Inference Feeds
  • Streaming dynagraph cards — real-time load and position data from load cell and position sensor
  • Live SPM and VFD frequency data per well
  • Motor current and gear box torque trending
  • Live production rates — test-corrected allocation
  • Fluid level shots as available — manual or acoustic fluid level gun readings
  • Surface facility real-time constraints
Continuous Learning Data
  • Operator override log — when and why pump-off control or SPM recommendations were overridden
  • New failure events — rod failures, pump failures — labeled with pre-failure card sequences
  • Production response data post-SPM change (actioned recommendations)
  • Monthly well test updates to refresh production allocation
  • Model drift monitoring — card classification accuracy and fillage prediction error trending
System Change Tracking
  • Rod string redesign records — new grade, size taper, and API design report — must be updated within 48 hours
  • Pump replacement records — new bore size, plunger class, valve type
  • Surface unit changes — stroke length adjustments, counterbalance modifications
  • VFD range updates following unit changes
  • Workover outcomes — tubing changes, packer depth, completion updates
Performance & Value Tracking
  • Production uplift from pump fillage optimization vs. pre-deployment baseline
  • Rod failure events predicted and avoided vs. total events — predictive value tracking
  • Pump run life improvement vs. baseline
  • Reduction in unnecessary workover interventions triggered by false pump-off signals
  • Operator adoption rate and recommendation acceptance trending
  • Model retraining triggers — driven by changes in fluid properties, pump design, or failure mode distribution
Ongoing Data Governance Requirements
Monthly well tests mandatory to maintain production allocation accuracy
Load cell and position sensor calibration checks quarterly
All rod string and pump changes must be logged within 48 hours with full design details
Model retrain trigger: card classification accuracy below 88% for 2 consecutive weeks
Annual full card library review and model refresh assessment
Phase Deliverables
Deployed Rod Pump Optimization Model Live Card Classification Engine Rod Failure Detection Module Operator Dashboard Performance Monitoring Report Continuous Learning Framework Data Governance Protocol
Augmentify
220 N Green St, 2nd floor
Chicago, IL 60607, USA
USA: 833-INTERSOG INT: +1-833-468-3776
Legal
Privacy Policy Terms of Use Do Not Sell My Personal Information
© 2026 Intersog Inc. All rights reserved.
Augmentify™ is a trademark of Intersog Inc.
Scroll to Top