attune/work-summary/sessions/2024-01-20-pack-testing-framework-phase1.md

# Pack Testing Framework - Phase 1 Implementation

**Date**: 2024-01-20
**Status**: 🔄 IN PROGRESS (Phase 1: 75% Complete)
**Component**: Pack Testing Framework
**Phase**: Phase 1 - Database Schema & Models

---

## Objective

Implement Phase 1 of the Pack Testing Framework to enable programmatic test execution during pack installation. This phase focuses on the foundational database layer and data models.

---

## Phase 1 Goals

### ✅ Completed (75%)

1. **Database Schema** ✅
   - Migration file created and applied
   - Tables, views, and functions implemented
   - Constraints and indexes in place

2. **Data Models** ✅
   - All models defined in common library
   - Serialization/deserialization configured
   - Type safety ensured

3. **Repository Layer** ✅
   - Full CRUD operations implemented
   - Query methods for test results
   - Statistics and filtering functions

4. **Design Documentation** ✅
   - Complete specification documented
   - Architecture defined
   - Integration points identified

### ⏳ Remaining (25%)

5. **Worker Test Executor** ⏳
   - Test discovery from pack.yaml
   - Runtime-aware test execution
   - Output parsing

6. **Result Storage** ⏳
   - Integration with repository
   - Error handling
   - Logging

---

## Completed Work

### 1. Database Migration

**File**: `migrations/20260120200000_add_pack_test_results.sql` (154 lines)

**Tables Created:**
- `attune.pack_test_execution` - Main table for test execution tracking
  - Stores complete test results as JSONB
  - Tracks pass/fail counts, pass rate, duration
  - Links to pack with CASCADE delete
  - Supports trigger reasons: install, update, manual, validation

**Views Created:**
- `attune.pack_test_summary` - All test executions with pack details
- `attune.pack_latest_test` - Latest test result per pack

**Functions Created:**
- `get_pack_test_stats(pack_id)` - Statistical summary
- `pack_has_passing_tests(pack_id, hours_ago)` - Recent test validation
- `update_pack_test_metadata()` - Trigger function for metadata updates

**Indexes:**
- `idx_pack_test_execution_pack_id` - Fast pack lookup
- `idx_pack_test_execution_time` - Time-based queries
- `idx_pack_test_execution_pass_rate` - Filter by success rate
- `idx_pack_test_execution_trigger` - Filter by trigger reason

**Constraints:**
- Valid test counts (non-negative)
- Valid pass rate (0.0 to 1.0)
- Valid trigger reasons (install, update, manual, validation)

### 2. Data Models

**File**: `crates/common/src/models.rs`

**Models Added:**

```rust
// Database record
PackTestExecution {
    id, pack_id, pack_version, execution_time,
    trigger_reason, total_tests, passed, failed,
    skipped, pass_rate, duration_ms, result, created
}

// Test execution structure
PackTestResult {
    pack_ref, pack_version, execution_time,
    total_tests, passed, failed, skipped,
    pass_rate, duration_ms, test_suites[]
}

// Test suite (per runner type)
TestSuiteResult {
    name, runner_type, total, passed, failed,
    skipped, duration_ms, test_cases[]
}

// Individual test case
TestCaseResult {
    name, status, duration_ms,
    error_message, stdout, stderr
}

// Test status enum
TestStatus: Passed | Failed | Skipped | Error

// View models
PackTestSummary - Summary with pack details
PackLatestTest - Latest test per pack
PackTestStats - Statistical aggregation
```

### 3. Repository Layer

**File**: `crates/common/src/repositories/pack_test.rs` (410 lines)

**Methods Implemented:**

**Creation:**
- `create(pack_id, pack_version, trigger_reason, result)` - Record test execution

**Retrieval:**
- `find_by_id(id)` - Get specific test execution
- `list_by_pack(pack_id, limit, offset)` - Paginated pack tests
- `get_latest_by_pack(pack_id)` - Most recent test
- `get_all_latest()` - Latest test for all packs

**Statistics:**
- `get_stats(pack_id)` - Full statistical summary
  - Total executions, successful, failed
  - Average pass rate and duration
  - Last test time and status
- `has_passing_tests(pack_id, hours_ago)` - Recent validation

**Filtering:**
- `list_by_trigger_reason(reason, limit, offset)` - Filter by trigger
- `get_failed_by_pack(pack_id, limit)` - Failed executions only
- `count_by_pack(pack_id)` - Count total tests

**Maintenance:**
- `delete_old_executions(days_old)` - Cleanup old test data

**Tests:**
- Unit tests for all major operations (3 tests implemented)
- Tests marked as `#[ignore]` (require database)
- Coverage for creation, retrieval, statistics

### 4. Design Documentation

**File**: `docs/pack-testing-framework.md` (831 lines)

**Sections:**
1. **Overview** - Purpose and design principles
2. **Pack Manifest Extension** - pack.yaml testing configuration
3. **Test Discovery Methods** - Directory, manifest, executable
4. **Test Execution Workflow** - Installation flow and test process
5. **Test Result Format** - Standardized JSON structure
6. **Database Schema** - Complete schema documentation
7. **Worker Service Integration** - Test executor design
8. **CLI Commands** - Command specifications
9. **Test Result Parsers** - JUnit XML, TAP, simple formats
10. **Pack Installation Integration** - Modified workflow
11. **API Endpoints** - REST API design
12. **Best Practices** - Guidelines for pack authors
13. **Implementation Phases** - Phased rollout plan

**Key Features Designed:**
- Runtime-aware testing (shell, Python, Node.js)
- Fail-fast installation (tests must pass)
- Standardized result format across all runner types
- Database tracking for audit trail
- Configurable failure handling (block, warn, ignore)

### 5. Pack Configuration

**File**: `packs/core/pack.yaml`

**Added Testing Section:**
```yaml
testing:
  enabled: true
  discovery:
    method: "directory"
    path: "tests"
  runners:
    shell:
      type: "script"
      entry_point: "tests/run_tests.sh"
      timeout: 60
      result_format: "simple"
    python:
      type: "unittest"
      entry_point: "tests/test_actions.py"
      timeout: 120
      result_format: "simple"
  result_path: "tests/results/"
  min_pass_rate: 1.0
  on_failure: "block"
```

**Benefits:**
- Core pack now discoverable for testing
- Configuration demonstrates best practices
- Ready for automatic test execution

---

## Technical Implementation Details

### Database Design Decisions

1. **JSONB for Full Results**: Store complete test output for debugging
2. **Separate Summary Fields**: Fast queries without JSON parsing
3. **Views for Common Queries**: `pack_latest_test` for quick access
4. **Functions for Statistics**: Reusable logic in database
5. **Trigger Reasons**: Track why tests were run (install vs manual)
6. **CASCADE Delete**: Clean up test results when pack deleted

### Model Architecture

1. **Separation of Concerns**:
   - `PackTestExecution` - Database persistence
   - `PackTestResult` - Runtime test execution
   - Conversion handled in repository layer

2. **Type Safety**:
   - Strongly typed enums for test status
   - SQLx `FromRow` for database mapping
   - Serde for JSON serialization

3. **Extensibility**:
   - JSONB allows schema evolution
   - Test suites support multiple runner types
   - Test cases capture stdout/stderr for debugging

### Repository Patterns

1. **Consistent API**: All repositories follow same patterns
2. **Error Handling**: Uses common `Result<T>` type
3. **Async/Await**: All operations are async
4. **Connection Pooling**: Reuses PgPool efficiently
5. **Testing**: Unit tests with `#[ignore]` for database dependency

---

## Integration Points

### Current Integration

✅ **Common Library**:
- Models exported from `models::pack_test`
- Repository exported from `repositories::PackTestRepository`
- Available to all services

✅ **Database**:
- Schema applied to database
- Views and functions available
- Ready for queries

### Planned Integration (Phase 2)

⏳ **Worker Service**:
- Test executor will use repository to store results
- Runtime manager will execute tests
- Output parsers will populate `PackTestResult`

⏳ **CLI Tool**:
- `attune pack test` command
- Integration with pack install
- Display test results

⏳ **API Service**:
- Endpoints for test results
- Query test history
- Trigger manual tests

---

## Files Created/Modified

### Created Files (6)
1. `migrations/20260120200000_add_pack_test_results.sql` (154 lines)
2. `crates/common/src/repositories/pack_test.rs` (410 lines)
3. `docs/pack-testing-framework.md` (831 lines)
4. `work-summary/2024-01-20-core-pack-unit-tests.md` (Updated with framework info)
5. `work-summary/2024-01-20-pack-testing-framework-phase1.md` (This file)

### Modified Files (4)
1. `crates/common/src/models.rs` - Added pack_test module (130+ lines)
2. `crates/common/src/repositories/mod.rs` - Exported PackTestRepository
3. `packs/core/pack.yaml` - Added testing configuration
4. `CHANGELOG.md` - Documented Phase 1 progress
5. `work-summary/TODO.md` - Added Phase 1 tasks

---

## Verification

### Compilation Status
```bash
cd crates/common && cargo check
# Result: ✅ Finished successfully
```

### Database Migration
```bash
psql $DATABASE_URL < migrations/20260120200000_add_pack_test_results.sql
# Result: ✅ All objects created successfully
```

### Database Verification
```sql
\d attune.pack_test_execution
# Result: ✅ Table exists with all columns, indexes, constraints

SELECT * FROM attune.pack_latest_test;
# Result: ✅ View accessible (empty initially)
```

---

## Next Steps (Phase 2)

### 1. Worker Test Executor Implementation

**File**: `crates/worker/src/test_executor.rs`

**Requirements**:
- Parse pack.yaml testing configuration
- Discover test runners by type
- Execute tests with timeout protection
- Parse output (simple format initially)
- Create `PackTestResult` structure
- Store results via `PackTestRepository`

**Pseudocode**:
```rust
struct TestExecutor {
    runtime_manager: Arc<RuntimeManager>,
    test_repo: PackTestRepository,
}

impl TestExecutor {
    async fn execute_pack_tests(
        pack_dir: &Path,
        pack_id: i64,
    ) -> Result<PackTestResult> {
        // 1. Load pack.yaml
        // 2. Parse testing config
        // 3. For each runner type:
        //    - Get runtime
        //    - Execute test script
        //    - Parse output
        //    - Collect results
        // 4. Aggregate results
        // 5. Store in database
        // 6. Return result
    }
}
```

### 2. Simple Output Parser

**File**: `crates/worker/src/test_parsers/simple.rs`

**Requirements**:
- Parse our bash test runner output format
- Extract: "Total Tests:", "Passed:", "Failed:"
- Create `TestSuiteResult` from parsed data
- Handle errors gracefully

**Format**:
```
Total Tests:  36
Passed:       36
Failed:       0
✓ All tests passed!
```

### 3. CLI Integration

**File**: `crates/cli/src/commands/pack.rs`

**New Command**: `attune pack test <pack>`

**Requirements**:
- Load pack from filesystem or registry
- Call worker test executor
- Display results with colors
- Exit with appropriate code (0=pass, 1=fail)

**Options**:
- `--verbose` - Show detailed test output
- `--runtime <type>` - Test specific runtime only

### 4. Pack Install Integration

**File**: `crates/cli/src/commands/pack.rs`

**Modify**: `install_pack()` function

**Requirements**:
- Check if testing enabled in pack.yaml
- Run tests before activation
- Handle failure based on `on_failure` setting
- Store results in database
- Display test summary

**Options**:
- `--skip-tests` - Skip testing
- `--force` - Install even if tests fail

---

## Testing Strategy

### Unit Tests (Completed)
- ✅ Repository unit tests (3 tests)
- ✅ Model serialization tests (via serde)

### Integration Tests (Planned)
- [ ] End-to-end test execution
- [ ] Database result storage
- [ ] CLI command testing
- [ ] Pack install with tests

### Manual Testing (Planned)
```bash
# Test core pack
attune pack test core

# Test with verbose output
attune pack test core --verbose

# Install with testing
attune pack install ./packs/my_pack

# Install skipping tests
attune pack install ./packs/my_pack --skip-tests
```

---

## Performance Considerations

### Database
- Indexes on common query patterns (pack_id, time, pass_rate)
- JSONB for flexible storage without schema changes
- Views for fast common queries
- Function for statistics to avoid repeated logic

### Test Execution
- Timeouts prevent hanging tests
- Parallel execution possible (future enhancement)
- Test caching can speed up repeated runs (future)

### Cleanup
- `delete_old_executions()` for pruning old data
- CASCADE delete removes tests when pack deleted
- Configurable retention period

---

## Documentation Status

✅ **Design Document**: Complete and comprehensive
✅ **Database Schema**: Fully documented with comments
✅ **Code Documentation**: All functions documented
✅ **Integration Guide**: Best practices documented
✅ **API Specification**: Designed (implementation pending)

---

## Metrics

### Lines of Code
- Migration: 154 lines
- Models: 130+ lines
- Repository: 410 lines
- Design Doc: 831 lines
- **Total: ~1,525 lines**

### Test Coverage
- Repository tests: 3 tests
- Model tests: Implicit via serde
- Integration tests: Pending Phase 2

### Database Objects
- Tables: 1
- Views: 2
- Functions: 3
- Triggers: 1
- Indexes: 5
- Constraints: 4

---

## Conclusion

Phase 1 of the Pack Testing Framework is 75% complete with the database schema, models, and repository layer fully implemented. The foundation is solid and ready for Phase 2 implementation of the worker test executor and CLI integration.

**Key Achievements**:
- ✅ Complete database schema with audit trail
- ✅ Type-safe models with proper serialization
- ✅ Comprehensive repository with statistics
- ✅ Detailed design documentation
- ✅ Core pack configured for testing

**Immediate Next Steps**:
1. Implement worker test executor
2. Create simple output parser
3. Add CLI `pack test` command
4. Integrate with pack install workflow

**Timeline Estimate**:
- Phase 2 (Worker + CLI): 4-6 hours
- Phase 3 (Advanced Features): 6-8 hours
- Testing & Polish: 2-4 hours

---

**Last Updated**: 2024-01-20
**Phase**: 1 of 4 (Database & Models)
**Status**: 75% Complete - Ready for Phase 2