About GlossKit
GlossKit exists to support language documentation as structured, reviewable, and reusable research infrastructure.
Mission
GlossKit exists to support language documentation as structured, reviewable, and reusable research infrastructure. The aim is to make complex documentation workflows easier to coordinate without erasing the interpretive work at their center.
GlossKit grows out of Shuan Osman Karim's work in historical linguistics, documentary linguistics, morphology, and underdocumented Iranic languages. That background shapes the platform's emphasis on traceable primary data, reviewed analysis, and exportable research outputs.
Why GlossKit Exists
Many documentation projects depend on strong local practice but weak workflow continuity. Recordings, segment files, transcripts, glossing decisions, lexicon evidence, syntax work, and archival exports often live in separate systems with fragile handoffs between them.
GlossKit is being developed to reduce that fragmentation while respecting the reality that serious documentation work remains iterative and collaborative.
Platform Philosophy
GlossKit is designed to address recurring bottlenecks in documentation projects: duplicated metadata entry, weak provenance between stages, unclear status between editable and finalized analysis, limited reuse of reviewed morphology, brittle transitions from text analysis to lexicon and syntax, and export work that begins too late in the project lifecycle.
Modular, workflow-oriented design
Each stage should have enough structure to support downstream reuse, but not so much rigidity that it forces analysts into false certainty.
Relationship to established tools
GlossKit is framed as complementary to ELAN, FLEx, Praat, Universal Dependencies practices, and archive-specific workflows.
Human-in-the-loop analysis
ASR suggestions, token analyses, lexicon evidence, and syntax initialization are most useful when they accelerate careful work rather than replace it.
Transparent automation
The platform is intended to support transparent automation, evidence-bearing curation, and project-level coordination rather than black-box analysis.
Connected workflow coordination
GlossKit’s role is to provide a more connected web-based workflow layer that can coordinate recordings, reviewed text analysis, structured linguistic outputs, and export packaging.
Roadmap discipline
Near-term development is best framed around stronger segmentation validation, tighter ASR correction loops, improved syntax heuristics, richer export coverage, and clearer module hardening.
Roadmap Preview
Longer-term work may include phonology and grammar-writing modules built on the structured outputs of the earlier pipeline. GlossKit is intended to modernize and coordinate documentation workflows without flattening the judgment and collaboration that make those workflows meaningful.
Bring documentation workflows into one coordinated platform.
Request beta access to follow the platform as it develops or contact the GlossKit team to discuss a future use case.