You are viewing a preview of this job. Log in or register to view more details about this job.

Data Engineering/Software Intern

Data Engineering / Software Intern (12-Week Internship)

Overview

We are seeking a motivated and detail-oriented intern to design and build a data reprocessing and analysis tool to support testing and validation of our proprietary algorithms. This role is ideal for someone interested in data processing, analytics, and building user-facing tools.

Zeptive is a smart sensor company that is revolutionizing air monitoring technology. Our cutting-edge sensors ensure healthier and safer environments by providing real-time detection beyond where the eye can see. We develop innovative detectors that are highly accurate and power-efficient, such as patented battery powered vape detectors, air quality monitors, and other safety sensing devices. Zeptive sensors are enhancing vigilance and protection globally in schools, workplaces, hotels, resorts, libraries, and other public spaces. Our commitment to excellence and innovation drives us to continually push the boundaries of what's possible in smart sensor technology and air quality monitoring, ensuring safety for everyone.

 

Project Objective

Develop a software tool that enables efficient reprocessing and evaluation of time-series data to measure algorithm performance, including classification accuracy and error rates.

 

Key Responsibilities

  • Design and build a data reprocessing tool for time-series .csv datasets
  • Implement logic to:
    • Load and parse structured .csv files with predefined headers and time intervals
    • Handle and clean imperfect data (e.g., ignore empty or malformed rows)
  • Develop functionality to:
    • Reprocess datasets using different algorithm configurations
    • Classify results into:
      • True Positives
      • False Positives
      • True Negatives
      • False Negatives
  • Create a user interface that allows users to:
    • Select and load data files
    • Group or annotate datasets (e.g., label as true/false cases)
    • Trigger reprocessing runs
  • Generate outputs including:
    • Summary statistics
    • Performance metrics (percentages for each classification category)
    • Exportable results (e.g., CSV or report format)

Expected Deliverables (By End of Internship)

  • A functional data reprocessing application
  • User interface for file selection, annotation, and analysis
  • Robust data parsing and cleaning logic
  • Accurate calculation and reporting of performance metrics
  • Documentation covering:
    • System design
    • How to use the tool
    • Assumptions and limitations

Preferred Qualifications

  • Pursuing a degree in Computer Science, Data Science, Electrical Engineering, or related field
  • Experience with:
    • Python, JavaScript, or similar programming languages
    • Data processing libraries (e.g., Pandas or equivalent)
    • Basic UI development (web or desktop frameworks)
  • Familiarity with:
    • Time-series data
    • Data visualization or reporting tools
  • Strong problem-solving skills and attention to detail

 

Learning Opportunities

  • Hands-on experience building a real-world data processing tool
  • Exposure to algorithm validation and performance analysis
  • End-to-end development experience (data ingestion → processing → UI → reporting)

Duration

12 weeks (Summer Internship)