📊 Summary
Data Collection Flow
The foundation of our algorithm is high-quality, representative data from startup employees. This page explains how we collect, validate, and process that data.
✅ Eligibility Requirements
Tenure
Minimum 12 months at current company
Verification
Corporate email and/or LinkedIn verification
Participation
One vote set per employee per cycle
Minimum Tenure Requirement
Minimum Tenure Requirement
Only employees with ≥12 months tenure can submit bullishness ratings. This requirement serves two important purposes:
- Ensures participants have meaningful context about their company
- Aligns with typical 1-year vesting cliffs, so participants likely have equity at stake
Identity Verification Process
Identity Verification Process
We use a multi-layer verification approach:
- Corporate email verification with OAuth or confirmation code
- Optional LinkedIn profile confirmation
- Employment status check through company directory (when available)
- Periodic re-verification
Participation Limitations
Participation Limitations
To maintain data integrity:
- Each employee can submit only one set of votes per voting cycle
- Votes cannot be changed after submission within the same cycle
- Consistent voting patterns are monitored across cycles
🔄 Two Key Input Types
- Self-Bullishness Rating
- Peer Admiration
Sample Distribution of 85 Employee Bullishness Ratings
🔍 Data Validation
Completeness Check
Ensuring all required fields are filled properly
Range Verification
Confirming ratings fall within allowed bounds
Uniqueness Check
Preventing duplicate submissions or selections
Theme Alignment
Validating that companies match the investment theme of the village
⚖️ Weighting Employee Votes
Employee Weight Calculation
Where:
- years_beyond_minimum = max(0, tenure_years - 1)
- seniority_factor ranges from 0 (entry level) to 0.5 (executive)
Tenure Weighting
Tenure Weighting
Employees with longer tenure receive progressively higher weights:
- 1 year (minimum): Base weight
- 2 years: +0.2 weight
- 3 years: +0.4 weight
- 4 years: +0.6 weight
- 5+ years: +0.8 weight (maximum tenure bonus)
Seniority Weighting
Seniority Weighting
Position in the company provides additional weight:
- Entry level: No additional weight
- Team lead/manager: +0.1 weight
- Director/Senior Manager: +0.3 weight
- VP/Executive: +0.5 weight
Optional Historical Accuracy Factor
Optional Historical Accuracy Factor
In some implementations, we may incorporate historical accuracy:
- Based on how well an employee’s past bullishness ratings correlated with actual outcomes
- Requires multiple cycles of data
- Applied as a multiplier to the base weight
📅 Data Collection Timeframes
Our data collection follows a structured quarterly schedule:1
Notification Phase
All eligible participants receive notifications 3 days before the collection window opens
2
Collection Window
A 2-week period when all voting occurs, with reminders sent at the beginning, middle, and 2 days before closing
3
Processing Period
Following collection, data is validated, weighted, and prepared for the algorithm
4
Results & Rebalancing
Algorithm results are finalized and rebalancing recommendations are implemented
🔒 Privacy & Anonymity
Anonymized Results
Individual votes never attributed to specific people
Aggregated Data
Only aggregate scores used in the algorithm
Secure Storage
Encrypted storage with strict access controls
The privacy of individual voting data is critical to the integrity of the system. Without anonymity, employees might be reluctant to provide honest assessments, especially if they have concerns about their own company.
Minimum Data Thresholds
For a company to be included in the ranking calculation, we enforce minimum data requirements:Full Inclusion Criteria
- At least 3 eligible employees submitting bullishness ratings
- At least 2 mentions from other companies in admiration lists
Partial Inclusion Criteria
- 1-2 employees submitting bullishness ratings
- At least 1 mention from another company
- Flagged with limited confidence indicator
Below Threshold Handling
- May appear in village portfolios at reduced weights
- Marked as having limited data support
- Receive higher uncertainty scores in confidence intervals