Help:Quality control

From BugSigDB
Picture 1. Quality control tools.
Picture 2. The currently applied tags appear as "turned on" in the dropdown.
Picture 3. This link can refresh the displayed data immediately.

There are 2 primary methods of quality control on the BugSigDB data, in addition to the review system.

Study-level flags[edit]

Wiki editors can manually flag Studies as having quality issues. The list of tags is controlled by the Property:Quality control tag and includes (as of today):

  • Retracted paper
  • Contamination issues suspected
  • Batch effect issues suspected
  • Uncontrolled confounding suspected
  • Results are suspect (various reasons)


If a Study is flagged with at least one tag, the "Quality issues" label is displayed below the Review state label (see Picture 1).

Studies have the "Quality control" dropdown with the list of tags, that can be "turned on / off" with a click (see Picture 2). Tags for the current Study are stored along with the Study Review. Please, note that quality issues tags do not affect / depend on the Review state, and reflect possible issues with the Study itself, not with the way it was entered into the wiki.

For example, “Results are suspect (various reasons)” means that the wiki’s editors looked at the original study and decided the results look suspicious. Editors can enable/disable one or more tags. Clicking on the tag labels does not close the dropdown. In this way, editors can send multiple instructions to tag or untag: when all desired tags are set, editors must click the "Apply tags" button once to submit the tags to the database. Explanation as to why the tag was applied can and should be supplied on the page's "Discussion."

The "Apply tags" button is only available to editors. Readers will only see the status of the tags for the current study in a drop-down list.

It can take up to a minute for tags on the current page to update. For immediate control, consider using the "Refresh" link in the page tools dropdown (see Picture 3).

Taxon-level alerts[edit]

If the taxon itself or any of its parent taxa is found in the list, a warning icon with a tooltip is displayed next to the name of the taxon (see Picture 4).

The quality control for taxa implements the tests described below. Each taxon on the Signatures pages is checked against all the tests and marked with a specific icon if a match is found.

Potential contaminants (!)[edit]

Picture 4. Potential contaminant alert.

Individual taxa that should not be found in signatures are automatically marked as suspicious. Tests are run on page load, with each taxon being compared against a list of "potential contaminants."

Taxa are checked against the potential contaminants stored at the MediaWiki:ContaminationList page. This list was derived from the OpenContaminant Blacklist. The taxon is marked as a potential contaminant if the taxon ID or any of its parent taxa match one of the IDs listed at that page.

The list of potential contaminants on MediaWiki:ContaminationList can be modified by wiki administrators or members of the interface-admin group with the standard edit action here. The only requirement is to put one NCBI ID per line.

Not host-associated (X)[edit]

Taxa are checked against the curated list not host-associated NCBI Taxonomy IDs stored at the MediaWiki:ContaminationNotHostList page. The taxon is marked as a not host-associated taxaon if the taxon ID or any of its parent taxa match one of the IDs listed at that page.

Not host-associated taxa were obtained from screening through the NCBI taxonomy at phylum level, and tracing further down the taxonomy where a host-associated habitat could not be ruled out at phylum level. However, it should be noted that these classifications are not always clear-cut and may be modified by administrators over time as new findings are published.

The list of not host-associated taxa on MediaWiki:ContaminationNotHostList can be modified by wiki administrators or members of the interface-admin group with the standard edit action here. The only requirement is to put one NCBI ID per line.

Body site-typical (prevalent) (?)[edit]

Taxa are checked (regardless of the Experiment's associated body site) against taxon IDs stored at the Body site-typical microbiome signatures for adults page which indicate body-site typical taxa at a 50% prevalence level for healthy children and adults. Typical taxa are available for vagina, skin, feces, nasal cavity, milk, and oral cavity body sites. The taxon is marked as prevalent in a specific body site if the taxon ID or any of its parent taxa match one of the IDs listed at that page.

Data can be modified by changing the linked Signatures in the Body site-typical microbiome signatures for adults page.

See also[edit]