Whole Genome Sequencing and the food industry

Posted: 8 November 2016 | Greg Jones, Senior Research Officer, Microbiology, Campden BRI | No comments yet

Whole Genome Sequencing (WGS) has the potential to render other forms of microbiological identification obsolete. New Food takes a closer look…

Figure 1: A small portion of a genome assembled from raw sequence reads. Each of the small black bars represents one sequence approximately 500 bases long.

Whole Genome Sequencing (WGS) has the potential to render other forms of microbiological identification obsolete. It is more accurate than a serotype, more discriminatory than a pulsed-field gel electrophoresis assay and it can prove relationships between strains with higher resolution than ever before. This is the method which has been adopted by regulatory agencies such as Public Health England and the Food and Drug Administration in the USA. Food companies are starting to become more aware of this area and wish to have a constructive dialogue with government agencies, however there can be a lack of knowledge regarding the technology and the potential uses of it in an industrial setting.

genome

The technique has become more prevalent in the last five years due to rapid advances in sequencing technology that have led to dramatic falls in cost per sequence. It is now possible to sequence genomes on a routine basis for a few hundred pounds each. Regardless of the technique used, the generation of huge amounts of sequence data has become entirely unremarkable. When a genome is sequenced, the initial output is hundreds of thousands of short sequences. Each of these sequences is a few hundred bases long and represents a tiny fragment of the total genome. The next challenge is to assemble these reads by comparing them to each other and ordering them according to their overlapping ends. This process is analogous to reconstructing an ancient document from fragments of parchment.

Figure 1: A small portion of a genome assembled from raw sequence reads. Each of the small black bars represents one sequence approximately 500 bases long.

The workflow to assemble a genome is straightforward:

Obtain an isolate via culture-based methods.
Extract the DNA.
Prepare the DNA for sequencing (“Library Preparation”).
Run the sequencer.
Assemble the short raw reads into longer sequences using software.

The next step is to give it some meaning by comparing it to other sequences. The comparison is reliant on the number of other genomes against which your submitted sequence is compared. The analysis gets larger as more genomes are added in to the comparison. Comparing whole genomes against databases of other whole genomes is currently performed by the FDA in the USA using their ‘Genome Trakr’ service. This service relies on the vast storage and computing power available to them from the National Centre for Biotechnology Information (NCBI). This is a resource available to anybody, and a genome submitted for analysis will be placed into context via comparison against other sequences. The output is a phylogenetic tree similar to the one shown in Figure 2.

genome2

Figure 2: Output from Genome Trakr

In the example above, the genome labelled as an ‘Environment/Food’ sample is highlighted in red, and is shown to cluster very closely with a set of isolates designated as ‘Clinical’.

The necessary use of a public database for this analysis has led to concern from some in the food industry who fear that doing the right thing and submitting sequences will reflect badly on them in the event their sequence is shown to be related to an outbreak. Despite assurances from the FDA in a recent meeting, US industry representatives still had some concerns that samples submitted with accompanying descriptions of its source that could ultimately be traced back to the company of origin are so sensitive that in-house legal advice is to not submit at all. The mood in the UK is similar, with companies approaching this method with a degree of caution.

Is this caution warranted?

A similar tool for tracing outbreaks exists in the form of PulseNet, based on DNA fingerprinting technology. What is new for WGS is the finer level of discrimination. As this information is available to anyone, the submitting company will be alerted to any clinical link at the same time as the regulator, allowing earlier action to be taken. Submitters’ names are not made publicly available, but could be held by the regulator. Industry is therefore more likely to submit sequences if their describing metadata can be made anonymous. Earlier notification of a link to an outbreak is in everyone’s best interest, and it will be in the submitter’s interest to be removed from the investigative focus if the submitted sequence does not match clinical data. Despite these clear advantages, there is still the worry that a current isolate can be linked to outbreaks that occurred at any time, and that a current outbreak could be linked to an isolate submitted at any time. If the food industry is to use this technique and work constructively with the regulators, these issues need to be addressed and binding assurances given by the regulator that the industry’s desire to protect public health through the use of WGS will not result in an increased probability of prosecution should an unfounded link be made.

Campden BRI is actively working with industry and regulators to advise and help reach a mutually beneficial result. If you would like to explore Whole Genome Sequencing in more depth, please get in touch.

Related organisations

Food & Drug Administration (FDA)

Cookie	Description
cookielawinfo-checkbox-advertising-targeting	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Advertising & Targeting".
cookielawinfo-checkbox-analytics	This cookie is set by GDPR Cookie Consent WordPress Plugin. The cookie is used to remember the user consent for the cookies under the category "Analytics".
cookielawinfo-checkbox-necessary	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-performance	This cookie is set by GDPR Cookie Consent WordPress Plugin. The cookie is used to remember the user consent for the cookies under the category "Performance".
PHPSESSID	This cookie is native to PHP applications. The cookie is used to store and identify a users' unique session ID for the purpose of managing user session on the website. The cookie is a session cookies and is deleted when all the browser windows are closed.
viewed_cookie_policy	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.
zmember_logged	This session cookie is served by our membership/subscription system and controls whether you are able to see content which is only available to logged in users.

Cookie	Description
cf_ob_info	This cookie is set by Cloudflare content delivery network and, in conjunction with the cookie 'cf_use_ob', is used to determine whether it should continue serving “Always Online” until the cookie expires.
cf_use_ob	This cookie is set by Cloudflare content delivery network and is used to determine whether it should continue serving “Always Online” until the cookie expires.
free_subscription_only	This session cookie is served by our membership/subscription system and controls which types of content you are able to access.
ls_smartpush	This cookie is set by Litespeed Server and allows the server to store settings to help improve performance of the site.
one_signal_sdk_db	This cookie is set by OneSignal push notifications and is used for storing user preferences in connection with their notification permission status.
YSC	This cookie is set by Youtube and is used to track the views of embedded videos.

Cookie	Description
bcookie	This cookie is set by LinkedIn. The purpose of the cookie is to enable LinkedIn functionalities on the page.
GPS	This cookie is set by YouTube and registers a unique ID for tracking users based on their geographical location
lang	This cookie is set by LinkedIn and is used to store the language preferences of a user to serve up content in that stored language the next time user visit the website.
lidc	This cookie is set by LinkedIn and used for routing.
lissc	This cookie is set by LinkedIn share Buttons and ad tags.
vuid	We embed videos from our official Vimeo channel. When you press play, Vimeo will drop third party cookies to enable the video to play and to see how long a viewer has watched the video. This cookie does not track individuals.
wow.anonymousId	This cookie is set by Spotler and tracks an anonymous visitor ID.
wow.schedule	This cookie is set by Spotler and enables it to track the Load Balance Session Queue.
wow.session	This cookie is set by Spotler to track the Internet Information Services (IIS) session state.
wow.utmvalues	This cookie is set by Spotler and stores the UTM values for the session. UTM values are specific text strings that are appended to URLs that allow Communigator to track the URLs and the UTM values when they get clicked on.
_ga	This cookie is set by Google Analytics and is used to calculate visitor, session, campaign data and keep track of site usage for the site's analytics report. It stores information anonymously and assign a randomly generated number to identify unique visitors.
_gat	This cookies is set by Google Universal Analytics to throttle the request rate to limit the collection of data on high traffic sites.
_gid	This cookie is set by Google Analytics and is used to store information of how visitors use a website and helps in creating an analytics report of how the website is doing. The data collected including the number visitors, the source where they have come from, and the pages visited in an anonymous form.

Cookie	Description
advanced_ads_browser_width	This cookie is set by Advanced Ads and measures the browser width.
advanced_ads_page_impressions	This cookie is set by Advanced Ads and measures the number of previous page impressions.
advanced_ads_pro_server_info	This cookie is set by Advanced Ads and sets geo-location, user role and user capabilities. It is used by cache busting in Advanced Ads Pro when the appropriate visitor conditions are used.
advanced_ads_pro_visitor_referrer	This cookie is set by Advanced Ads and sets the referrer URL.
bscookie	This cookie is a browser ID cookie set by LinkedIn share Buttons and ad tags.
IDE	This cookie is set by Google DoubleClick and stores information about how the user uses the website and any other advertisement before visiting the website. This is used to present users with ads that are relevant to them according to the user profile.
li_sugr	This cookie is set by LinkedIn and is used for tracking.
UserMatchHistory	This cookie is set by Linkedin and is used to track visitors on multiple websites, in order to present relevant advertisement based on the visitor's preferences.
VISITOR_INFO1_LIVE	This cookie is set by YouTube. Used to track the information of the embedded YouTube videos on a website.

Recommended

Whole Genome Sequencing and the food industry

The workflow to assemble a genome is straightforward:

Is this caution warranted?

Related topics

Related organisations

Leave a Reply Cancel reply

Recommended

Whole Genome Sequencing and the food industry

The workflow to assemble a genome is straightforward:

Is this caution warranted?

Related topics

Related organisations

PFAS: Navigating regulations, challenges, risk management and testing in the food supply chain

FSA full steam ahead: the push for approval on new alternative proteins

UK Food Standards Agency awarded £1.4m to support new innovation hub focusing on precision fermentation

Turning waste into food: scientists transform whey into sustainable protein

Understanding the PFAS risk in food supply chains

Leave a Reply Cancel reply