Skip to contents

This dataset, Germ_lines_df, is a data frame containing germline sequence information for multiple species. The dataset includes 7,285 observations and 8 variables, covering the nucleotide sequence, gene names, species, chain type, accession number, official species name, source, and strain. All values are preserved as in the original source, with no modifications.

Usage

data(Germ_lines_df)

Format

A data frame with 7,285 observations and 8 variables:

sequence

Germline nucleotide sequence (factor)

names

Gene name(s) (character string)

species

Species code or name (character string)

chain

Chain type (factor)

accession

Accession number (factor)

species_name_official

Official species name (character string)

source

Source of the germline sequence (character string)

strain

Strain or line information (character string)

Source

Data taken from the VDJgermlines package version 0.1

Details

The dataset name has been kept as 'Germ_lines_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the BioWorldR package and assists users in identifying its specific characteristics. The suffix 'df' indicates that the dataset is a data frame. The original content has not been modified.