[Sprint 10] feat: Create data-integrity-audit.md skill #373

Closed
opened 2026-02-02 06:20:55 +00:00 by lmiranda · 0 comments
Owner

Summary

Create the foundational skill that defines what "data valid" means for the data-platform domain. This skill is loaded by the data-advisor agent and provides audit rules, severity classification, scanning strategy, and report templates.

Wiki Reference: Sprint 10 Implementation

File to Create

plugins/data-platform/skills/data-integrity-audit.md

Required Sections

  1. Purpose - When and why to audit data integrity
  2. What to Check - Check categories table (Schema Validity, dbt Project Health, Lineage Integrity, Data Type Consistency, PostGIS Compliance, Query Safety)
  3. Common Violations - Detection patterns for each violation type
  4. Severity Classification - FAIL/WARN/INFO levels with gate behavior
  5. Scanning Strategy - How to scan dbt projects, PostgreSQL schemas, DataFrames, Python code
  6. Report Template - Gate mode (compact) and Review mode (detailed) templates

MCP Tools Referenced

PostgreSQL: pg_tables, pg_columns, pg_schemas
dbt: dbt_parse, dbt_compile, dbt_test, dbt_ls, dbt_lineage
PostGIS: st_tables, st_geometry_type, st_srid, st_extent
pandas: describe, head, list_data

Acceptance Criteria

  • Skill file created with proper frontmatter
  • All 6 check categories documented with MCP tools
  • Common violations table with detection patterns
  • Severity classification matches viz-platform pattern (FAIL blocks gate, WARN/INFO don't)
  • Gate mode and review mode report templates defined
  • Scanning strategy covers dbt, PostgreSQL, PostGIS, pandas, Python code
## Summary Create the foundational skill that defines what "data valid" means for the data-platform domain. This skill is loaded by the `data-advisor` agent and provides audit rules, severity classification, scanning strategy, and report templates. **Wiki Reference:** [Sprint 10 Implementation](https://gitea.hotserv.cloud/personal-projects/leo-claude-mktplace/wiki/Change-V5.7.0%3A-Data-Platform-Domain-Advisory-%28Sprint-10-Implementation%29) ## File to Create `plugins/data-platform/skills/data-integrity-audit.md` ## Required Sections 1. **Purpose** - When and why to audit data integrity 2. **What to Check** - Check categories table (Schema Validity, dbt Project Health, Lineage Integrity, Data Type Consistency, PostGIS Compliance, Query Safety) 3. **Common Violations** - Detection patterns for each violation type 4. **Severity Classification** - FAIL/WARN/INFO levels with gate behavior 5. **Scanning Strategy** - How to scan dbt projects, PostgreSQL schemas, DataFrames, Python code 6. **Report Template** - Gate mode (compact) and Review mode (detailed) templates ## MCP Tools Referenced PostgreSQL: `pg_tables`, `pg_columns`, `pg_schemas` dbt: `dbt_parse`, `dbt_compile`, `dbt_test`, `dbt_ls`, `dbt_lineage` PostGIS: `st_tables`, `st_geometry_type`, `st_srid`, `st_extent` pandas: `describe`, `head`, `list_data` ## Acceptance Criteria - [ ] Skill file created with proper frontmatter - [ ] All 6 check categories documented with MCP tools - [ ] Common violations table with detection patterns - [ ] Severity classification matches viz-platform pattern (FAIL blocks gate, WARN/INFO don't) - [ ] Gate mode and review mode report templates defined - [ ] Scanning strategy covers dbt, PostgreSQL, PostGIS, pandas, Python code
lmiranda added the Type/Feature label 2026-02-02 06:20:55 +00:00
lmiranda added this to the Sprint 10 - Domain Advisory Pattern (data-platform) milestone 2026-02-02 06:21:39 +00:00
Sign in to join this conversation.
1 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: personal-projects/leo-claude-mktplace#373