Bases: BaseProcessor
Processor for Johnson 2023 manually curated annotations.
Processes two datasets: - Microarray (GPL570): GEO samples with MESH-formatted disease and tissue - RNA-seq (refine.bio): SRA samples with DOID disease and free-text tissue
All annotations have expert curation (ecode='expert').
process(output_dir=PROCESSED_DIR, **kwargs)
¶
Process Johnson 2023 datasets into standardized annotations.
| Parameters: |
|
|---|
| Returns: |
|
|---|
validate(data)
¶
Validate that processed Johnson 2023 data meets requirements.
| Parameters: |
|
|---|
| Returns: |
|
|---|
| Raises: |
|
|---|