Data Engineering/Software Intern
Data Engineering / Software Intern (12-Week Internship)
Overview
We are seeking a motivated and detail-oriented intern to design and build a data reprocessing and analysis tool to support testing and validation of our proprietary algorithms. This role is ideal for someone interested in data processing, analytics, and building user-facing tools.
Zeptive is a smart sensor company that is revolutionizing air monitoring technology. Our cutting-edge sensors ensure healthier and safer environments by providing real-time detection beyond where the eye can see. We develop innovative detectors that are highly accurate and power-efficient, such as patented battery powered vape detectors, air quality monitors, and other safety sensing devices. Zeptive sensors are enhancing vigilance and protection globally in schools, workplaces, hotels, resorts, libraries, and other public spaces. Our commitment to excellence and innovation drives us to continually push the boundaries of what's possible in smart sensor technology and air quality monitoring, ensuring safety for everyone.
Project Objective
Develop a software tool that enables efficient reprocessing and evaluation of time-series data to measure algorithm performance, including classification accuracy and error rates.
Key Responsibilities
- Design and build a data reprocessing tool for time-series .csv datasets
- Implement logic to:
- Load and parse structured .csv files with predefined headers and time intervals
- Handle and clean imperfect data (e.g., ignore empty or malformed rows)
- Develop functionality to:
- Reprocess datasets using different algorithm configurations
- Classify results into:
- True Positives
- False Positives
- True Negatives
- False Negatives
- Create a user interface that allows users to:
- Select and load data files
- Group or annotate datasets (e.g., label as true/false cases)
- Trigger reprocessing runs
- Generate outputs including:
- Summary statistics
- Performance metrics (percentages for each classification category)
- Exportable results (e.g., CSV or report format)
Expected Deliverables (By End of Internship)
- A functional data reprocessing application
- User interface for file selection, annotation, and analysis
- Robust data parsing and cleaning logic
- Accurate calculation and reporting of performance metrics
- Documentation covering:
- System design
- How to use the tool
- Assumptions and limitations
Preferred Qualifications
- Pursuing a degree in Computer Science, Data Science, Electrical Engineering, or related field
- Experience with:
- Python, JavaScript, or similar programming languages
- Data processing libraries (e.g., Pandas or equivalent)
- Basic UI development (web or desktop frameworks)
- Familiarity with:
- Time-series data
- Data visualization or reporting tools
- Strong problem-solving skills and attention to detail
Learning Opportunities
- Hands-on experience building a real-world data processing tool
- Exposure to algorithm validation and performance analysis
- End-to-end development experience (data ingestion → processing → UI → reporting)
Duration
12 weeks (Summer Internship)