Home › Latest › Blog › Overcoming genome sequencing gaps to accurately characterise a genetically modified mosquito strain

Overcoming genome sequencing gaps to accurately characterise a genetically modified mosquito strain

Posted on August 03, 2022

By Silke Fuchs

Regulatory Science Officer, Imperial College London
Target Malaria

Genome sequencing of malaria vectors has given scientists a roadmap to identify the location of a potential transgene insertion site. However, doing so accurately is compromised by repetitive, unannotated, and fragmented elements on vectors’ genomes. For example, it is estimated that 33% of the Anopheles gambiae genome is composed of repetitive elements.

In a recent study, published in Pathogens and Global Health, my colleagues from Imperial College London, Polo GGB, King’s College London, and I have used multiple methods to extensively characterize and identify the transgene insertion sites in the non gene drive genetically modified male bias mosquito strain. This strain is an intermediate step towards the development of gene drive mosquito as a tool to fight malaria and the molecular characterization of the transgene will support the prediction of dynamics of this modification in potential target field populations.

Figure: FISH on polytene chromosomes from ovarian nurse cells of non gene drive male bias mosquitoes. A shows an overview of dissected Anopheles gambiae autosomes (chromosome 2R, 2L, 3R, 3L). B Fluorescence microscopy shows the chromosomes (in green) and the signal (in red and indicated by an arrow) which labels the transgene insertion site in the proximity of the centromere of chromosome 2 R.

In this study, we combined novel techniques in sequencing and scaffolding¹: Whole Genome Sequencing (WGS), Southern blotting, Fluorescence in situ hybridization (FISH) and Polymerase Chain Reaction (PCR) analysis to identify a single insertion of the transgene with the insertion on a single chromosome.

The combination of these methods revealed that the modification was on a different chromosome than was previously described in Galizi, 2014. Based on our findings we concluded that inverse PCR, which produces relatively short flanking sequences, may be less suitable for highly repetitive long DNA sequences (>700 bp) to ascertain correct identification of transgene landing sites. Further, FISH analysis, which does not rely on DNA sequences, can be a very powerful tool to narrow down the chromosomal locations of transgenes in unannotated regions.

These findings were important and showed that combining various sequencing and non-sequencing techniques in this way could support others in the development of novel targets for vector control, off-target analysis, and characterization of insertion sites of genetically modified strains.

The WHO has emphasized in its’ Guidance Framework for testing of genetically modified mosquitoes that for any genetically modified organism that is considered for release, a molecular characterization of the transgene and its neighbouring DNA regions are essential for their safety assessment and post-release monitoring. This study reveals a robust method pipeline that could support this effort.

¹ Scaffolding is a technique used in bioinformatics. It is defined as follows: Link together a non-contiguous series of genomic sequences into a scaffold, consisting of sequences separated by gaps of known length. The sequences that are linked are typically contiguous sequences corresponding to read overlaps EDAM – Bioscientific data analysis ontology – Scaffolding – Classes | NCBO BioPortal (bioontology.org)

Cookie	Duration	Description
__ga	6 months	Used by Google Analytics. Tracking Traffic Sources and Navigation
__gid	6 months	Used by Google Analytics. Tracking Traffic Sources and Navigation
_gtag	6 months	Used by Google Analytics. Tracking Traffic Sources and Navigation
_fbp	3 months	Facebook sets this cookie to display advertisements when either on Facebook or on a digital platform powered by Facebook advertising after visiting the website.
_ga	2 years	This cookie is installed by Google Analytics. The cookie is used to calculate visitor, session, campaign data and keep track of site usage for the site's analytics report. The cookies store information anonymously and assign a randomly generated number to identify unique visitors.
_ga_*	1 year 1 month 4 days	Google Analytics sets this cookie to store and count page views.
_ga_H30PGVN8B7	2 years	No description
vuid	12 months	We use Vimeo to embed videos onto our website. These cookies are used by Vimeo to collect analytics tracking information.

Cookie	Duration	Description
VISITOR_INFO1_LIVE	6 months	YouTube sets this cookie to measure bandwidth, determining whether the user gets the new or old player interface.
VISITOR_PRIVACY_METADATA	6 months	YouTube sets this cookie to store the user's cookie consent state for the current domain.
YSC	session	Youtube sets this cookie to track the views of embedded videos on Youtube pages.
yt.innertube::nextId	never	YouTube sets this cookie to register a unique ID to store data on what videos from YouTube the user has seen.
yt.innertube::requests	never	YouTube sets this cookie to register a unique ID to store data on what videos from YouTube the user has seen.

Cookie	Duration	Description
yt-remote-cast-installed	session	The yt-remote-cast-installed cookie is used to store the user's video player preferences using embedded YouTube video.
yt-remote-connected-devices	never	YouTube sets this cookie to store the user's video preferences using embedded YouTube videos.
yt-remote-device-id	never	YouTube sets this cookie to store the user's video preferences using embedded YouTube videos.
yt-remote-fast-check-period	session	The yt-remote-fast-check-period cookie is used by YouTube to store the user's video player preferences for embedded YouTube videos.
yt-remote-session-app	session	The yt-remote-session-app cookie is used by YouTube to store user preferences and information about the interface of the embedded YouTube video player.
yt-remote-session-name	session	The yt-remote-session-name cookie is used by YouTube to store the user's video player preferences using embedded YouTube video.
ytidb::LAST_RESULT_ENTRY_KEY	never	The cookie ytidb::LAST_RESULT_ENTRY_KEY is used by YouTube to store the last search result entry that was clicked by the user. This information is used to improve the user experience by providing more relevant search results in the future.

Overcoming genome sequencing gaps to accurately characterise a genetically modified mosquito strain

By Silke Fuchs

Sign up to our newsletter