Waypoint TelemetryWaypoint Telemetry
Technical ResearchComputer VisionAI/ML

VisionAI - Subject Noise Filter (SNF)

A breakthrough algorithm achieving 96.1% accuracy in vision-based trail counting through intelligent subject noise filtering.

Zachary Pope
Waypoint Telemetry
Table of Contents

Executive Summary

Waypoint Telemetry is a vision-based analytics technology company that solves complex telemetry problems for the outdoors. Our VisionAI system has been developed to analyze trail camera telemetry data at scale and across sites to produce high-accuracy, multimodal analytics for trail use.

This report summarizes a recent advancement in our VisionAI system, called Subject Noise Filtering (SNF), which has brought our overall detection error rate of human subjects from 30.10% in our initial pilot to just 3.88%.

96.1%
Accuracy Rate
3.88%
Error Rate (Down from 30.10%)
6
Test Environments

Our studies show that the SNF algorithm is particularly effective in high-use, crowded scenarios where groups, children, and multi-modal activity is occurring.

The Problem

Vision-based analysis offers a powerful upgrade over traditional IR sensors for trail counting. However, AI algorithms must meet and exceed the accuracy and reliability of traditional IR methods to be a viable replacement.

Subject Noise Challenge

This report demonstrates and explains the issue of subject-noise and the implementation of the subject-noise filter (SNF) to eliminate undesirable false positives associated with subject noise due to real-world occurrences such as loitering, playing, backtracking, and more.

Subject crossing measurement area - frame 1
Subject crossing measurement area - frame 2
Subject crossing measurement area - frame 3
Subject crossing measurement area - frame 4
Figures 1-4: A real-world sequence of subjects crossing the measurement area, demonstrating scenarios that cause false positive error rates for both IR and simple motion-triggered camera methods. The SNF algorithm solves this problem.

Solution

Outdoor recreation monitoring has been predominantly sourced from traditional IR counters, which detect objects crossing a line of sight across a path. This is similar to the detection you see in front of grocery stores that trigger automatic doors. Any obstruction across the IR sensor (subject to some algorithmic filtering) will trigger a "subject detection."

Traditional IR Sensor Limitations
  • Animal crossings causing false positives
  • Missed detections in groups
  • Over-detection due to loitering or staggering of subjects

Vision-based technology has been the focus of Waypoint Telemetry. In this study, we use off-the-shelf trail cams to detect movement of subjects, then apply vision-based AI analysis to process the imagery and detect people, objects, activities, and other activity attributes.

Multi-modal Object Detection

In modern software architecture today, off-the-shelf libraries and AI models exist to classify photos. These range from light-weight, pre-trained models up to frameworks that support custom training. The model we have leveraged is the YOLO family of image object classification, which produces an array of detected objects from an image and associated confidence scores.

YOLO object detection example showing detected objects with confidence scores
Figure 5: Example of how a standard YOLO model works, detecting objects with confidence scores. Pre-trained YOLO models are very effective, and additional fine-tuning produces even higher accuracy for trail use.

Identifying Unique Subjects

Reverse Embedding Technology

We use reverse embedding vision technology to assign an embedding code for each detected subject within a window of images, and use similarity scores to determine if the same individual is in the frame within a specific time-frame.

This allows us to detect "noisy appearances" such as staggering, loitering, or other duplicate appearances of individuals. Each individual is assigned a unique embedding sequence and unique ID. Subsequent detected subjects are then checked for uniqueness.

Individuals assigned unique embedding sequences and IDs
Figure 6: Each individual is assigned a unique embedding sequence and unique ID. Subsequent detected subjects are checked for uniqueness.

Scenes, Confidence Scores and Similarity Scores

In the example below, we have two consecutive images taken less than 2 minutes apart. Both images clearly have the same two individuals. Using the SNF, we can see that the first image has unique individuals labeled. The next image properly detects them as duplicates and disregards them in the count, as expected in a high-accuracy counting system.

First frame showing unique individuals labeled
Second frame showing duplicates detected and filtered
Figures 7-8: Two consecutive images less than 2 minutes apart. The SNF correctly identifies the same individuals and filters duplicates.

Dynamic Scene Adjuster

In our testing, we found that different scenes require different approaches to our multi-modal detection libraries and SNF filters. Our ability to differentiate between different activities and distinguish unique individuals decreases with poorer environments.

Sunny Day (High Intelligibility)
Sunny day scene with high visibility
Night Time (Low Intelligibility)
Night time scene with reduced visibility

Dynamic Adjustment

Our SNF feature dynamically adjusts to the detected environment, optimizing detection parameters based on lighting conditions, visibility, and scene complexity.

Noise vs Reappearances

One important discussion point for the SNF implementation is the difference between noise and reappearances. Often in trail environments, a subject will enter a trail system and leave at the same location. These reappearances typically happen 15-90 minutes apart, almost never within 5 minutes.

2-Minute Rolling Window

The intention of the SNF is NOT to address filtering out individuals who re-appear in a trail system, but to filter out noise of subjects triggering multiple detections.

If a subject is detected consecutively within a 2-minute window, that subject will only be counted once. The detection continues to be filtered until at least 2 minutes have passed without that person appearing.

Same person entering the park
Same person leaving the park 14.5 minutes later
Figure 9: Same person entering and leaving the park, 14.5 minutes apart. These are two distinct detections, not noise.

Privacy

Privacy-First Approach

Given the sensitive nature of capturing real-world recreational activity, privacy is a critical consideration for trail counting applications. This particular study was conducted entirely on a local computer with no cloud storage of images or examples.

Facial Blur

Implemented

All detected subjects go through our VisionAI pipeline which includes a final step of 'head blur' to ensure that identifiable facial imaging is blurred prior to saving results to any cloud or computer.

30-Day Retention Policy

Planned

For our Waypoint Cloud platform, we have a 30-day retention policy planned whereby all source images are only retained for 30 days and automatically deleted thereafter. This allows users to utilize auditability and traceability while minimizing privacy and security risks.

Cloud Security - SOC2

Planned

A standard security and privacy framework (SOC2) is planned for our Waypoint Cloud architecture. Today, we have tight row-level-security (RLS) limiting access to cloud resources to only members and organizations which the resources belong.

Methods

Real-world applications of counting technologies in outdoor recreation involve many variables. In this study, we looked at 6 scenarios to demonstrate the problem scope and solution efficacy, using real camera captures from a pilot study at Three Rivers Parks in Bloomington, MN.

Weather & Visibility

Bright sun, gray clouds, night time, rain, snow

Multi-modal Subjects

Skiing, hiking, running, biking, snowmobiling

Animal Integration

Animal detections producing false positives

Groups & Crowds

Multiple people moving together

Test Environments

Environment 1: Bright Sun, Busy Cross Country Ski Trail
TRPD Site 4

Environment 1: Bright Sun, Busy Cross Country Ski Trail

Children and Families

Environment 2: Bright Sun, Busy Cross Country Ski Trail
TRPD Site 4

Environment 2: Bright Sun, Busy Cross Country Ski Trail

Second Round with different crowds

Environment 3: Early Morning, Gray Clouds
TRPD Site 4

Environment 3: Early Morning, Gray Clouds

Snow fall, fast skiers

Environment 4: Full Day (Dawn, Day, Evening)
TRPD Site 5

Environment 4: Full Day (Dawn, Day, Evening)

Multimodal trail 1

Environment 5: Full Day (Dawn, Day, Evening)
TRPD Site 3

Environment 5: Full Day (Dawn, Day, Evening)

Multimodal trail 2

Environment 6: Night-time Skiing
TRPD Site 6

Environment 6: Night-time Skiing

Low visibility conditions

Candidate Evaluations

Baseline Truth

Going through each detected image sequence and counting by hand: total images, images with people, total unique individuals. This method is considered 'truth' for error rate calculation.

Does not account for false negatives (subjects where the trail cam did not trigger).

VisionAI Version 1

Simple AI pipeline with minimal customizations. Used YOLOv8 to detect people with a default confidence score cutoff of 0.25 for all environments with no additional filters.

Used in TRPD trail study from 2/16/2026-3/4/2026.

VisionAI Version 2Current

The advanced algorithm being evaluated with Subject Noise Filter, custom AI vision model for trail directionality, 1st stage multi-modal detection with dynamic confidence scoring.

  • Subject-noise-filter with unique subject embeddings
  • Dynamic similarity scores for near-consecutive frames
  • Custom AI vision model for trail directionality
  • General confidence threshold of 0.5

Results

The table below shows total unique individuals detected after filtering, not raw detection counts. This measures how well each approach identifies distinct people while filtering out noise from repeated detections, loitering, and other false positives.

Unique Subject Detection

Across all 6 test environments, error rates for identifying unique individuals were:

VisionAI v1 (C=0.5)30.10% error
VisionAI v2 + SNF3.88% error

The SNF dramatically improved accuracy in filtering noise while maintaining correct unique individual counts.

Noise Reduction

The real performance improvements come in noise from real-world environments:

VisionAI v130.10% error
VisionAI v2 + SNF3.88% error
87.1% improvement in noise reduction
Real-World Challenge Scenarios

These situations include animal crossings, groups walking, multimodal activities triggering multiple times (skis+person+poles or bike), staggering, loitering, children playing, etc. In these real-world cases:

Without SNF
30.10%
Error Rate
With SNF
3.88%
Error Rate
Test CaseBaselineVisionAI - v1 (C=0.25)VisionAI - v1 (C=0.5)VisionAI - v2
2-28-26 carrier9101010
early-morning-walkers-dark17201717
gray-fast16181614
night14141414
sunny-crowded15342415
sunny-crowded-2321625337
Totals103258134107
Error Rate150.49%30.10%3.88%

Conclusion

Vision-based trail activity monitoring is an exciting approach that unlocks several key benefits compared to traditional IR trail counting.

Advanced Noise Filters

Vision-based AI analysis allows for more advanced noise filters critical for real-world scenarios such as loitering, groups, human-animal groups like dog-walkers.

Multi-modal Detections

Activity detections enable better accuracy through human/object/animal filtering that traditional IR counters cannot do.

Modern Trail Cam Technology

Coupling off-the-shelf trail cam technology with proprietary AI systems and cloud-based software enables reliable, high-accuracy trail counting.

Final Result

By coupling modern off-the-shelf trail cam technology with proprietary AI systems and cloud-based software, vision-based technology can be reliably used for trail counting purposes with high accuracy. The introduction of SNF brought our error rate down to 3.88% across a wide variety of challenging environments.

Appendix

Complete Test Data
Test CaseTotal ImagesImages w/ PeopleTotal DetectionsActual SubjectsActual UniquesDetected PeopleDetected UniquesMissedBad Det.
2-28-26 carrier14101099101001
early-morning-walkers-dark2015171717201700
gray-fast1616161816181420
night1414141414141400
sunny-crowded1313212215341510
sunny-crowded-233315253321623710

Summary Statistics

Total Images
110
VisionAI v1 Total
258
150.49% error
VisionAI v2 Total
107
3.88% error
Baseline Truth
103