### **AEC/APC Symposium XXI**

# Improving Data Quality in Embedded Sensor Systems for APC

Julien Amelot, NIST

julien.amelot@nist.gov

Ya-Shian Li-Baboud, NIST

ya-shian.li-baboud@nist.gov



### Goals

To characterize data quality for factory systems and develop fundamental information technology and engineering methods to improve next-generation factory APC applications.



Knowing is half the battle





## Impact for APC

Data Producer

Data Quality

Accurate Knowledge

**Embedded** Equipment Sensor System **Timeliness** 

Completeness

Real-Time APC

Trustworthiness

### NGF cost & cycle time reduction:

- Deterministic data availability and analysis
- Timely, accurate knowledge enables rapid realtime advanced process control and minimizing/ eliminating false positives/negatives
- The faster the problems are detected and resolved, the less equipment use and time are wasted



## Data Quality Goals

### Time-Stamping

Within nanoseconds of event occurrence

#### **Data Sampling**

- Deterministic sampling
- No data loss
- 10 000 Hz and beyond

#### Understand embedded resources impact

- Design needs to meet data sampling requirements
- Limitations given embedded resources available
- Optimize data sampling

To experimentally evaluate tradeoffs in system design features to meet data quality objectives for next-generation APC applications.



### Overview of Sensor Network Testbed





### Sensor Network

#### Requirements

- Time stamping within sub microsecond accuracy
- Deterministic 200 kHz data sampling

#### **Results**

- Robot sound localization within accuracy of 10 cm
- Ability to handle 4 microphones (sensors) at 200 kHz sampling rate

### Relevance to APC data quality

- Requires time-stamping close to event occurrence
- High data sampling rates (APC: 100 Hz to 10 kHz)
- Requires minimal data loss for meaningful results (ideally no data loss)
- Minimize hardware cost





## Embedded Hardware Design

- Analog to Digital Converter
  - 200KHz
- FieldProgrammableGate Array
  - Spartan 3E
  - 100MHz
- CPU
  - Cirrus EP9302
  - ARM9
  - 200MHz





### Embedded sensor data flow





## Optimizing data sampling

#### **Process**

- Build a model of the system
- Characterize all critical parameters
- Identify the different constraints and bottleneck of the system
- Find the best set of parameters according to the needs

### Data quality metrics considered

- Data sampling rate
- Data loss
- Data time-stamping latency/jitter
- Data processing latency/jitter



# Ŋ

National Institute of Standards and Technology

## Model of the system (part of)



## M

### Impact of the size of the buffer (RAM)



# M

# Impact of the speed of the Bus







National Institute of Standards and Technology

Optimizing data sampling parameters High probability to Good set of **Theoretical** overflow the FPGA's parameters behavior buffer **Worst case** behavior actually measured Data throughput too low to transfer all the 50 data 0.2 0.6 0.8 0.4 x 10<sup>-3</sup> 14 Interrupt Period (s)









### **CPU** load effects







## Embedded data quality analysis

#### Kernel Interrupts

 Prioritizing data acquisition at the kernel level provides more reliable data processing rates

#### FPGA/FPGA Memory

- Alleviates requirements on CPU processing (e.g. interrupt frequency)
- Timely processing of contextual information pertaining to the data (e.g. time stamps, uncertainty, etc.)

#### CPU Load

- Using non-preemptable interrupts, the CPU load is less of an issue; however, the non-preemptable interrupt may sacrifice performance of other processes running on the same CPU
- Multiple non-preemptable processes running on the same CPU would adversely affect the performance of data sampling

#### Communication bus

 Data sampling rate is also limited by the size of the communication bus(es) between system components (e.g. FPGA and system processor).





### Lessons Learned

- Ideally, design of embedded sensor device for APC requires assessment of requirements and limitations:
  - Measurement accuracy
  - Time-stamping accuracy
  - Data acquisition sampling rate
  - Data size
  - Data loss tolerance
  - Processing capability of systems in the data flow
- In designing equipment sensor devices, use of FPGA offers several benefits:
  - Allow accurate and reliable time-stamping
  - Time-stamping close to data measurement from the sensors
  - Provides hardware reliability, with software reconfigurability to meet changing needs
- Given limited resources, system tradeoffs can accommodate more stringent data acquisition requirements. For example,
  - If memory resources are low, interrupts will need to be more frequent
  - If system processor must manage a high level of priority interrupts, the design should allow for the memory buffer to store a larger quantity of data messages:
    - More memory
    - Improve data compression



### Future Research

- Additional tests comparing resource tradeoffs
  - Kernel interrupts v. User space applications
  - FPGA buffer size
  - Multi-core architectures
- Develop and test compression techniques
  - To enable improved data quality under limited conditions
- Develop improved data filtering / signal processing on the FPGA
  - Potential to discard noisy data to reduce data processing volume for CPU
- Extend simulation to address NGF-specific industry requirements
  - Need to understand next-generation APC requirements in terms of data sampling metrics
  - Develop embedded design methods to address NGF needs
    - Multi-core design
    - Collaborate with University of Michigan on use of IEEE 1588 for time synchronization



### References & Questions

- SEMI E151 Guide for Understanding Data Quality
- ISMI Next-Generation Factory Industry Briefing, O. Rothe, July 2009 <a href="http://ismi.sematech.org/meetings/archives/ismi/8718/450Briefing.pdf">http://ismi.sematech.org/meetings/archives/ismi/8718/450Briefing.pdf</a>
- Data Quality and Time Synchronization, G. Crispieri, December 2006
  http://ismi.sematech.org/meetings/archives/emanufacturing/8069/04-DataQuality.pdf



Official contribution of the National Institute of Standards and Technology; not subject to copyright in the United States. Certain commercial equipment, instruments, or materials are identified in this paper to foster understanding. Such identification does not imply recommendation or endorsement by the National Institute of Standards and Technology, nor does it imply that the materials or equipment identified are necessarily the best available for the purpose.