This dataset, Germ_lines_df, is a data frame containing germline sequence information for multiple species. The dataset includes 7,285 observations and 8 variables, covering the nucleotide sequence, gene names, species, chain type, accession number, official species name, source, and strain. All values are preserved as in the original source, with no modifications.
Usage
data(Germ_lines_df)
Format
A data frame with 7,285 observations and 8 variables:
- sequence
Germline nucleotide sequence (factor)
- names
Gene name(s) (character string)
- species
Species code or name (character string)
- chain
Chain type (factor)
- accession
Accession number (factor)
- species_name_official
Official species name (character string)
- source
Source of the germline sequence (character string)
- strain
Strain or line information (character string)
Details
The dataset name has been kept as 'Germ_lines_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the BioWorldR package and assists users in identifying its specific characteristics. The suffix 'df' indicates that the dataset is a data frame. The original content has not been modified.