Description
We are in the process of curating Tahoe 100m and a number of public datasets which will require support for cell lines.
In addition, Idan, Jenny, and I have consolidated around a schema proposal.
Proposal (@gabdank)
- Field Rename: tissue_type → sample_type
Change the primary field name from tissue_type to sample_type to better reflect the broader range of biological sample sources being curated
Idan created GitHub issue #1365
- Enhanced Sample Type Categories (formerly known as tissue_type):
Addition of cell line as a new sample_type category
Rename cell culture → primary cell culture to distinguish from immortalized cell lines
Retain existing categories: tissue, organoid
- Add in_vitro_cell_ontology_term_id:
Required for primary cell culture: Must use CL (Cell Ontology) terms
Required for cell line: Must use Cellosaurus identifiers
NA for tissue and organoid types
- Update the logic for tissue_ontology_term_id:
Required when sample_type is tissue or organoid and is UBERON,
NA for cell line and primary cell culture types
An alternative proposal that would save us from adding new property.
- Field Rename: tissue_type → sample_type
Change the primary field name from tissue_type to sample_type to better reflect the broader range of biological sample sources being curated
Idan created GitHub issue #1365
- Field rename tissue_ontology_term_id -> sample_ontology_term_id
Required when sample_type is tissue or organoid and Must use UBERON terms,
Required for primary cell culture: Must use CL (Cell Ontology) terms
Required for cell line: Must use Cellosaurus identifiers
- Enhanced Sample Type Categories (formerly known as tissue_type):
Addition of cell line as a new sample_type category
Rename cell culture → primary cell culture to distinguish from immortalized cell lines
Retain existing categories: tissue, organoid