The files included this data release are the raw and processed deoxyribonucleic acid (DNA) sequence files referenced in the journal article (cited below) by Lawler and others (2016) titled "Coral-Associated Bacterial Diversity is Conserved across Two Deep-Sea Anthothela Species." They represent a 16S rRNA gene amplicon survey of the coral's (Anthothela spp.) microbiome, completed using Roche 454 pyrosequencing with titanium reagents. The samples used in this study were collected from cold-water corals between 2012-2013, from Baltimore and Norfolk Canyons, in the Atlantic Ocean and the samples were collected in 2012-2013. The raw data files associated with this study have also been submitted to the National Center for Biotechnology Information (NCBI) Sequence Read Archive (SRA) under Bioproject number PRJNA296835. For more information, you may contact Christina Kellogg at the USGS St. Petersburg Coastal and Marine Science Center, 600 4th Street South, St. Petersburg, Florida, USA, 33710; Telephone: (727) 502-8128; Email: ckellogg@usgs.gov Citation: Lawler, S.N., Kellogg, C.A., France, S.C., Clostio, R.W., Brooke, S.D., and Ross, S.W., 2016, Coral-associated bacterial diversity is conserved across two deep-sea Anthothela species: Frontiers in Microbiology, v., 7, art. 458, https://doi.org/10.3389/fmicb.2016.00458 The file labeled "raw_data.zip" contains the individual sample SFF sequence files. The file labeled "processed_data.zip" contains a text document that details the scripts run in the bioinformatic package QIIME (Caporaso and others, 2010, Nature Methods 7:335-336, doi:10.1038/nmeth.f.303), default or chosen settings used for each script, and the input/output files associated with each script: anthothela_workflow.txt. The second file labeled "anthothela_qiime" includes all of the directories, input, and output files that are described in the workflow file. The samples were sequenced on two different pyrosequencing runs, so there are four SFF files (two for each run). To perform any initial splitting of the samples from these runs (which also contain other sequences from different studies), it is necessary to have a mapping file that corresponds to each run. The file titled "Plate_3008_map.txt" is the mapping file that goes with sequencing files ID2UZ7K01.sff and ID2UZ7K02.sff from the first run. The file titled "Plate_3009_map.txt" is the mapping file that goes with sequencing files ID4F0Y301.sff and ID4F0Y302.sff from the second run. A comprehensive mapping file labeled "Anthothela_map.txt" is also included combining all samples and environmental data for easy comparison. Both data zip files also contain the associated FGDC metadata, Sequence Read Archive (SRA) and MIMARKS compliant metadata (see below for details). The files labeled "Anthothela_BaltimoreNorfolk_SRA.metadata.zip" and "anthothela_metadata.zip" are SRA and MIMARKS (minimum information about a marker gene) compliant metadata, based on standards developed by the Genomic Standards Consortium for reporting marker gene sequences (Yilmaz and others, 2011, Nature Biotechnology 29:415-420, doi:10.1038/nbt.1823). The column headers are defined as follows: project_name: a description of the project's contents sample_name: a unique identifier for each sample bioproject_id: a unique accession number under which the raw data files have been submitted to the NCBI Sequence Read Archive (SRA) sample_title: a more detailed version of sample_name organism: what is being sequenced; in this case, a coral metagenome. Metagenome is defined as "a collection of genetic material (genomes) from a mixed community of organisms." host: organism with which the microbial community being sequenced is associated collection_date: the date upon which the samples were collected; format is year geo_loc_name: geographic location name for where the samples were collected lat_lon: the latitude and longitude where the samples were collected; units are decimal degrees samp_collect_device: sample collection device that was used to obtain these samples samp_mat_process: sample material process; specifically, how were the samples preserved in the field for later study source_material_id: identification of the source material that the sequenced samples were derived from (more specific than host; e.g., what part of the host) samp_size: sample size; what amount of sample was used to extract DNA for sequencing nucl_acid_ext: nucleic acid extraction; doi is provided for the paper which describes the specific extraction method employed for these samples nucl_acid_amp: nucleic acid amplification; doi is provided for the paper which describes the specific primers and amplification methods employed for these samples lib_size: library size; how many sequences were present in each sample at the beginning of analysis target_gene: which marker gene was targeted by the chosen primers; 16S ribosomal RNA Gene target_subfragment: what portion of the marker gene was targeted by the chosen primers; variable regions V4-V5 of the 16S rRNA Gene pcr_primers: sequences of the forward and reverse primers used in polymerase chain reaction to amplify the DNA mid: multiplex identifiers; arbitrary short DNA sequence used to identify the sequences from this particular sample out of the pool of samples included on a multiplexed run seq_meth: sequence method; what sequencing technology was used to produce these data depth: water depth at which coral samples were collected; units are meters temp: water temperature at which coral samples were collected; units in degrees Celsius samp_store_temp: sample storage temperature between field collection and laboratory processing; -20 degrees Celsius salinity: salinity at which coral samples were collected; psu = practical salinity units contact_name: contact person for data contact_email: email of contact person for data