Search results for: genome rearrangement
Commenced in January 2007
Frequency: Monthly
Edition: International
Paper Count: 399

Search results for: genome rearrangement

159 Whole Coding Genome Inter-Clade Comparison to Predict Global Cancer-Protecting Variants

Authors: Lamis Naddaf, Yuval Tabach

Abstract:

In this research, we identified the missense genetic variants that have the potential to enhance resistance against cancer. Such field has not been widely explored, as researchers tend to investigate mutations that cause diseases, in response to the suffering of patients, rather than those mutations that protect from them. In conjunction with the genomic revolution, and the advances in genetic engineering and synthetic biology, identifying the protective variants will increase the power of genotype-phenotype predictions and can have significant implications on improved risk estimation, diagnostics, prognosis and even for personalized therapy and drug discovery. To approach our goal, we systematically investigated the sites of the coding genomes and picked up the alleles that showed a correlation with the species’ cancer resistance. We predicted 250 protecting variants (PVs) with a 0.01 false discovery rate and more than 20 thousand PVs with a 0.25 false discovery rate. Cancer resistance in Mammals and reptiles was significantly predicted by the number of PVs a species has. Moreover, Genes enriched with the protecting variants are enriched in pathways relevant to tumor suppression like pathways of Hedgehog signaling and silencing, which its improper activation is associated with the most common form of cancer malignancy. We also showed that the PVs are more abundant in healthy people compared to cancer patients within different human races.

Keywords: comparative genomics, machine learning, cancer resistance, cancer-protecting alleles

Procedia PDF Downloads 70
158 Start-Up: The Perception of Brazilian Entrepreneurs about the Start-Up Brasil Program

Authors: Fernando Nobre Cavalcante

Abstract:

In Brazil, and more recently in the city of Fortaleza, there is a new form of entrepreneurship that is focused on the information and communication technology service sector and that draws the attention of young people, investors, governments, authors and media companies: it is known as the start-up movement. Today, it is considered to be a driving force behind the creative economy. Rooted on progressive discourse, the words enterprise and innovation seduce new economic agents motivated by success stories from Silicon Valley in America along with increasing commercial activity for digital goods and services. This article assesses, from a sociological point of view, the new productive wave problematized by the light of Manuel Castells’ informational capitalism. Considering the skeptical as well as the optimistic opinions about the impact of this new entrepreneurial rearrangement, the following question is asked: How Brazilian entrepreneurs evaluate public policy incentives for startups Brazilian Federal Government? The raised hypotheses are based on employability factors as well as cultural, economical, and political matters related to innovation and technology. This study has produced a nationwide quantitative assessment with a special focus on the reality of these Ceará firms; as well as comparative qualitative interviews on Brazilian experiences lived by identified agents. This article outlines the public incentive policy of the federal government, the Start-up Brasil Program, from the perspective of these companies and provides details as to the discipline methods of the new enterprising way born in the United States. The startups are very young companies that are headed towards the economic sustainment of the productive sector services. These companies are dropping the seeds that will produce the re-enchantment of young people and bring them back to participation in political debate; they provide relief and reheats the job market; and they produce a democratization of the entrepreneurial ‘Do-It-Yourself’ culture. They capitalize the pivot of the wall street wolves and of agents being charged for new masks. There are developmental logic’s prophylaxis in the face of dreadful innovation stagnation. The lack of continuity in Brazilian governmental politics and cultural nuances related to entrepreneurship are barring the desired regional success of this ecosystem.

Keywords: creative economy, entrepreneurship, informationalism, innovation, startups, start-up brasil program

Procedia PDF Downloads 341
157 Mapping QTLs Associated with Salinity Tolerance in Maize at Seedling Stage

Authors: Mohammad Muhebbullah Ibne Hoque, Zheng Jun, Wang Guoying

Abstract:

Salinity stress is one of the most important abiotic factors contributing to crop growth and yield loss. Exploring the genetic basis is necessary to develop maize varieties with salinity tolerance. In order to discover the inherent basis for salinity tolerance traits in maize, 121 polymorphic SSR markers were used to analyze 163 F2 individuals derived from a single cross of inbred line B73 (a salt susceptible inbred line) and CZ-7 (a salt tolerant inbred line). A linkage map was constructed and the map covered 1195.2 cM of maize genome with an average distance of 9.88 cM between marker loci. Ten salt tolerance traits at seedling stage were evaluated for QTL analysis in maize seedlings. A total of 41 QTLs associated with seedling shoot and root traits were detected, with 16 and 25 QTLs under non-salinity and salinity condition, respectively. And only 4 major stable QTLs were detected in two environments. The detected QTLs were distributed on chromosomes 1, 2, 4, 5, 6, 7, 8, 9, and chromosome 10. Phenotypic variability for the identified QTLs for all the traits was in the range from 6.27 to 21.97%. Fourteen QTLs with more than 10% contributions were observed. Our results and the markers associated with the major QTL detected in this study have the potential application for genetic improvement of salt tolerance in maize through marker-assisted selection.

Keywords: salt tolerance, seedling stage, root shoot traits, quantitative trait loci, simple sequence repeat, maize

Procedia PDF Downloads 275
156 THRAP2 Gene Identified as a Candidate Susceptibility Gene of Thyroid Autoimmune Diseases Pedigree in Tunisian Population

Authors: Ghazi Chabchoub, Mouna Feki, Mohamed Abid, Hammadi Ayadi

Abstract:

Autoimmune thyroid diseases (AITDs), including Graves’ disease (GD) and Hashimoto’s thyroiditis (HT), are inherited as complex traits. Genetic factors associated with AITDs have been tentatively identified by candidate gene and genome scanning approaches. We analysed three intragenic microsatellite markers in the thyroid hormone receptor associated protein 2 gene (THRAP2), mapped near D12S79 marker, which have a potential role in immune function and inflammation [THRAP2-1(TG)n, THRAP2-2 (AC)n and THRAP2-3 (AC)n]. Our study population concerned 12 patients affected with AITDs belonging to a multiplex Tunisian family with high prevalence of AITDs. Fluorescent genotyping was carried out on ABI 3100 sequencers (Applied Biosystems USA) with the use of GENESCAN for semi-automated fragment sizing and GENOTYPER peak-calling software. Statistical analysis was performed using the non parametric Lod score (NPL) by Merlin software. Merlin outputs non-parametric NPLall (Z) and LOD scores and their corresponding asymptotic P values. The analysis for three intragenic markers in the THRAP2 gene revealed strong evidence for linkage (NPL=3.68, P=0.00012). Our results suggested the possible role of THRAP2 gene in AITDs susceptibility in this family.

Keywords: autoimmunity, autoimmune disease, genetic, linkage analysis

Procedia PDF Downloads 96
155 Whole Coding Genome Inter-Clade Comparisons to Predict Global Cancer-Protecting Variants

Authors: Lamis Naddaf, Yuval Tabach

Abstract:

We identified missense genetic variants with the potential to enhance resistance against cancer. Such a field has not been widely explored as researchers tend to investigate the mutations that cause diseases, in response to the suffering of patients, rather than those mutations that protect from them. In conjunction with the genomic revolution and the advances in genetic engineering and synthetic biology, identifying the protective variants will increase the power of genotype-phenotype predictions and have significant implications for improved risk estimation, diagnostics, prognosis, and even personalized therapy and drug discovery. To approach our goal, we systematically investigated the sites of the coding genomes and selected the alleles that showed a correlation with the species’ cancer resistance. Interestingly, we found several amino acids that are more generally preferred (like the Proline) or avoided (like the Cysteine) by the resistant species. Furthermore, Cancer resistance in mammals and reptiles is significantly predicted by the number of the predicted protecting variants (PVs) a species has. Moreover, PVs-enriched-genes are enriched in pathways relevant to tumor suppression. For example, they are enriched in the Hedgehog signaling and silencing pathways, which its improper activation is associated with the most common form of cancer malignancy. We also showed that the PVs are mostly more abundant in healthy people compared to cancer patients within different human races.

Keywords: cancer resistance, protecting variant, naked mole rat, comparative genomics

Procedia PDF Downloads 72
154 Restriction on the Freedom of Economic Activity in the Polish Energy Law

Authors: Zofia Romanowska

Abstract:

Recently there have been significant changes in the Polish energy market. Due to the government's decision to strengthen energy security as well as to strengthen the implementation of the European Union common energy policy, the Polish energy market has been undergoing significant changes. In the face of these, it is necessary to answer the question about the direction the Polish energy rationing sector is going, how wide apart the powers of the state are and also whether the real regulator of energy projects in Poland is not in fact the European Union itself. In order to determine the role of the state as a regulator of the energy market, the study analyses the basic instruments of regulation, i.e. the licenses, permits and permissions to conduct various activities related to the energy market, such as the production and sale of liquid fuels or concessions for trade in natural gas. Bearing in mind that Polish law is part of the widely interpreted European Union energy policy, the legal solutions in neighbouring countries are also being researched, including those made in Germany, a country which plays a key role in the shaping of EU policies. The correct interpretation of the new legislation modifying the current wording of the Energy Law Act, such as obliging the entities engaged in the production and trade of liquid fuels (including abroad) to meet a number of additional requirements for the licensing and providing information to the state about conducted business, plays a key role in the study. Going beyond the legal framework for energy rationing, the study also includes a legal and economic analysis of public and private goods within the energy sector and delves into the subject of effective remedies. The research caused the relationships between progressive rationing introduced by the legislator and the rearrangement rules prevailing on the Polish energy market to be taken note of, which led to the introduction of greater transparency in the sector. The studies refer to the initial conclusion that currently, despite the proclaimed idea of liberalization of the oil and gas market and the opening of market to a bigger number of entities as a result of the newly implanted changes, the process of issuing and controlling the conduction of the concessions will be tightened, guaranteeing to entities greater security of energy supply. In the long term, the effect of the introduced legislative solutions will be the reduction of the amount of entities on the energy market. The companies that meet the requirements imposed on them by the new regulation to cope with the profitability of the business will in turn increase prices for their services, which will be have an impact on consumers' budgets.

Keywords: license, energy law, energy market, public goods, regulator

Procedia PDF Downloads 215
153 Genome-Wide Identification of Genes Resistance to Nitric Oxide in Vibrio parahaemolyticus

Authors: Yantao Li, Jun Zheng

Abstract:

Food poison caused by consumption of contaminated food, especially seafood, is one of most serious public health threats worldwide. Vibrio parahaemolyticus is emerging bacterial pathogen and the leading cause of human gastroenteritis associated with food poison, especially in the southern coastal region of China. To successfully cause disease in host, bacterial pathogens need to overcome the host-derived stresses encountered during infection. One of the toxic chemical species elaborated by the host is nitric oxide (NO). NO is generated by acidified nitrite in the stomach and by enzymes of the inducible NO synthase (iNOS) in the host cell, and is toxic to bacteria. Bacterial pathogens have evolved some mechanisms to battle with this toxic stress. Such mechanisms include genes to sense NO produced from immune system and activate others to detoxify NO toxicity, and genes to repair the damage caused by toxic reactive nitrogen species (RNS) generated during NO toxic stress. However, little is known about the NO resistance in V. parahaemolyticus. In this study, a transposon coupled with next generation sequencing (Tn-seq) technology will be utilized to identify genes for NO resistance in V. parahaemolyticus. Our strategy will include construction the saturating transposon insertion library, transposon library challenging with NO, next generation sequencing (NGS), bioinformatics analysis and verification of the identified genes in vitro and in vivo.

Keywords: vibrio parahaemolyticus, nitric oxide, tn-seq, virulence

Procedia PDF Downloads 243
152 Phylogenetic Analyses of Newcastle Disease Virus Isolated from Unvaccinated Chicken Flocks in Kyrgyzstan from 2015 to 2016

Authors: Giang Tran Thi Huong, Hieu Dong Van, Tung Dao Duy, Saadanov Iskender, Isakeev Mairambek, Tsutomu Omatsu, Yukie Katayama, Tetsuya Mizutani, Yuki Ozeki, Yohei Takeda, Haruko Ogawa, Kunitoshi Imai

Abstract:

Newcastle disease virus (NDV) is a contagious viral disease of the poultry industry and other birds throughout the world. At present, very little is known about molecular epidemiological data regarding the causes of ND outbreak in commercial poultry farms in Kyrgyzstan. In the current study, the NDV isolated from the one out of three samples from the unvaccinated flock was confirmed as NDV. Phylogenetic analysis indicated that this NDV strain is clustered in the Class II subgenotype VIId, and closely related to the Chinese NDV isolate. Phylogenetic analyses revealed that the isolated NDV strain has an origin different from the 4 NDV strains previously identified in Kyrgyzstan. According to the mean death time (MDT: 61.1 h) and a multibasic amino acid (aa) sequence at the F0 proteolytic cleavage site (¹¹²R-R-Q-K-R-F¹¹⁷), the NDV isolate was determined as mesogenic strain. Several mutations in the neutralizing epitopes (notably, ³⁴⁷E→K) and the global head were observed in the hemagglutinin-neuraminidase (HN) protein of the current isolate. The present study represents the molecular characterization of the coding gene region of NDV in Kyrgyzstan. Additionally, further study will be investigated on the antigenic characterization using monoclonal antibody.

Keywords: Kyrgyzstan, Newcastle disease, genotype, genome characterization

Procedia PDF Downloads 113
151 Evaluation and Assessment of Bioinformatics Methods and Their Applications

Authors: Fatemeh Nokhodchi Bonab

Abstract:

Bioinformatics, in its broad sense, involves application of computer processes to solve biological problems. A wide range of computational tools are needed to effectively and efficiently process large amounts of data being generated as a result of recent technological innovations in biology and medicine. A number of computational tools have been developed or adapted to deal with the experimental riches of complex and multivariate data and transition from data collection to information or knowledge. These bioinformatics tools are being evaluated and applied in various medical areas including early detection, risk assessment, classification, and prognosis of cancer. The goal of these efforts is to develop and identify bioinformatics methods with optimal sensitivity, specificity, and predictive capabilities. The recent flood of data from genome sequences and functional genomics has given rise to new field, bioinformatics, which combines elements of biology and computer science. Bioinformatics is conceptualizing biology in terms of macromolecules (in the sense of physical-chemistry) and then applying "informatics" techniques (derived from disciplines such as applied maths, computer science, and statistics) to understand and organize the information associated with these molecules, on a large-scale. Here we propose a definition for this new field and review some of the research that is being pursued, particularly in relation to transcriptional regulatory systems.

Keywords: methods, applications, transcriptional regulatory systems, techniques

Procedia PDF Downloads 94
150 Alloy Design of Single Crystal Ni-base Superalloys by Combined Method of Neural Network and CALPHAD

Authors: Mehdi Montakhabrazlighi, Ercan Balikci

Abstract:

The neural network (NN) method is applied to alloy development of single crystal Ni-base Superalloys with low density and improved mechanical strength. A set of 1200 dataset which includes chemical composition of the alloys, applied stress and temperature as inputs and density and time to rupture as outputs is used for training and testing the network. Thermodynamic phase diagram modeling of the screened alloys is performed with Thermocalc software to model the equilibrium phases and also microsegregation in solidification processing. The model is first trained by 80% of the data and the 20% rest is used to test it. Comparing the predicted values and the experimental ones showed that a well-trained network is capable of accurately predicting the density and time to rupture strength of the Ni-base superalloys. Modeling results is used to determine the effect of alloying elements, stress, temperature and gamma-prime phase volume fraction on rupture strength of the Ni-base superalloys. This approach is in line with the materials genome initiative and integrated computed materials engineering approaches promoted recently with the aim of reducing the cost and time for development of new alloys for critical aerospace components. This work has been funded by TUBITAK under grant number 112M783.

Keywords: neural network, rupture strength, superalloy, thermocalc

Procedia PDF Downloads 282
149 Rapid Detection and Differentiation of Camel Pox, Contagious Ecthyma and Papilloma Viruses in Clinical Samples of Camels Using a Multiplex PCR

Authors: A. I. Khalafalla, K. A. Al-Busada, I. M. El-Sabagh

Abstract:

Pox and pox-like diseases of camels are a group of exanthematous skin conditions that have become increasingly important economically. They may be caused by three distinct viruses: camelpox virus (CMPV), camel contagious ecthyma virus (CCEV) and camel papillomavirus (CAPV). These diseases are difficult to differentiate based on clinical presentation in disease outbreaks. Molecular methods such as PCR targeting species-specific genes have been developed and used to identify CMPV and CCEV, but not simultaneously in a single tube. Recently, multiplex PCR has gained reputation as a convenient diagnostic method with cost- and time–saving benefits. In the present communication, we describe the development, optimization and validation a multiplex PCR assays able to detect simultaneously the genome of the three viruses in one single test allowing for rapid and efficient molecular diagnosis. The assay was developed based on the evaluation and combination of published and new primer sets, and was applied to the detection of 110 tissue samples. The method showed high sensitivity, and the specificity was confirmed by PCR-product sequencing. In conclusion, this rapid, sensitive and specific assay is considered a useful method for identifying three important viruses in specimens from camels and as part of a molecular diagnostic regime.

Keywords: multiplex PCR, diagnosis, pox and pox-like diseases, camels

Procedia PDF Downloads 440
148 Large-scale GWAS Investigating Genetic Contributions to Queerness Will Decrease Stigma Against LGBTQ+ Communities

Authors: Paul J. McKay

Abstract:

Large-scale genome-wide association studies (GWAS) investigating genetic contributions to sexual orientation and gender identity are largely lacking and may reduce stigma experienced in the LGBTQ+ community by providing an underlying biological explanation for queerness. While there is a growing consensus within the scientific community that genetic makeup contributes – at least in part – to sexual orientation and gender identity, there is a marked lack of genomics research exploring polygenic contributions to queerness. Based on recent (2019) findings from a large-scale GWAS investigating the genetic architecture of same-sex sexual behavior, and various additional peer-reviewed publications detailing novel insights into the molecular mechanisms of sexual orientation and gender identity, we hypothesize that sexual orientation and gender identity are complex, multifactorial, and polygenic; meaning that many genetic factors contribute to these phenomena, and environmental factors play a possible role through epigenetic modulation. In recent years, large-scale GWAS studies have been paramount to our modern understanding of many other complex human traits, such as in the case of autism spectrum disorder (ASD). Despite possible benefits of such research, including reduced stigma towards queer people, improved outcomes for LGBTQ+ in familial, socio-cultural, and political contexts, and improved access to healthcare (particularly for trans populations); important risks and considerations remain surrounding this type of research. To mitigate possibilities such as invalidation of the queer identities of existing LGBTQ+ individuals, genetic discrimination, or the possibility of euthanasia of embryos with a genetic predisposition to queerness (through reproductive technologies like IVF and/or gene-editing in utero), we propose a community-engaged research (CER) framework which emphasizes the privacy and confidentiality of research participants. Importantly, the historical legacy of scientific research attempting to pathologize queerness (in particular, falsely equating gender variance to mental illness) must be acknowledged to ensure any future research conducted in this realm does not propagate notions of homophobia, transphobia or stigma against queer people. Ultimately, in a world where same-sex sexual activity is criminalized in 69 UN member states, with 67 of these states imposing imprisonment, 8 imposing public flogging, 6 (Brunei, Iran, Mauritania, Nigeria, Saudi Arabia, Yemen) invoking the death penalty, and another 5 (Afghanistan, Pakistan, Qatar, Somalia, United Arab Emirates) possibly invoking the death penalty, the importance of this research cannot be understated, as finding a biological basis for queerness would directly oppose the harmful rhetoric that “being LGBTQ+ is a choice.” Anti-trans legislation is similarly widespread: In the United States in 2022 alone (as of Oct. 13), 155 anti-trans bills have been introduced preventing trans girls and women from playing on female sports teams, barring trans youth from using bathrooms and locker rooms that align with their gender identity, banning access to gender affirming medical care (e.g., hormone-replacement therapy, gender-affirming surgeries), and imposing legal restrictions on name changes. Understanding that a general lack of knowledge about the biological basis of queerness may be a contributing factor to the societal stigma faced by gender and sexual orientation minorities, we propose the initiation of large-scale GWAS studies investigating the genetic basis of gender identity and sexual orientation.

Keywords: genome-wide association studies (GWAS), sexual and gender minorities (SGM), polygenicity, community-engaged research (CER)

Procedia PDF Downloads 46
147 ZBTB17 Gene rs10927875 Polymorphism in Slovak Patients with Dilated Cardiomyopathy

Authors: I. Boroňová, J. Bernasovská, J. Kmec, E. Petrejčíková

Abstract:

Dilated cardiomyopathy (DCM) is a severe cardiovascular disorder characterized by progressive systolic dysfunction due to cardiac chamber dilatation and inefficient myocardial contractility often leading to chronic heart failure. Recently, a genome-wide association studies (GWASs) on DCM indicate that the ZBTB17 gene rs10927875 single nucleotide polymorphism is associated with DCM. The aim of the study was to identify the distribution of ZBTB17 gene rs10927875 polymorphism in 50 Slovak patients with DCM and 80 healthy control subjects using the Custom Taqman®SNP Genotyping assays. Risk factors detected at baseline in each group included age, sex, body mass index, smoking status, diabetes and blood pressure. The mean age of patients with DCM was 52.9±6.3 years; the mean age of individuals in control group was 50.3±8.9 years. The distribution of investigated genotypes of rs10927875 polymorphism within ZBTB17 gene in the cohort of Slovak patients with DCM was as follows: CC (38.8%), CT (55.1%), TT (6.1%), in controls: CC (43.8%), CT (51.2%), TT (5.0%). The risk allele T was more common among the patients with dilated cardiomyopathy than in normal controls (33.7% versus 30.6%). The differences in genotype or allele frequencies of ZBTB17 gene rs10927875 polymorphism were not statistically significant (p=0.6908; p=0.6098). The results of this study suggest that ZBTB17 gene rs10927875 polymorphism may be a risk factor for susceptibility to DCM in Slovak patients with DCM. Studies of numerous files and additional functional investigations are needed to fully understand the roles of genetic associations.

Keywords: ZBTB17 gene, rs10927875 polymorphism, dilated cardiomyopathy, cardiovascular disorder

Procedia PDF Downloads 381
146 Computational Prediction of the Effect of S477N Mutation on the RBD Binding Affinity and Structural Characteristic, A Molecular Dynamics Study

Authors: Mohammad Hossein Modarressi, Mozhgan Mondeali, Khabat Barkhordari, Ali Etemadi

Abstract:

The COVID-19 pandemic, caused by SARS-CoV-2, has led to significant concerns worldwide due to its catastrophic effects on public health. The SARS-CoV-2 infection is initiated with the binding of the receptor-binding domain (RBD) in its spike protein to the ACE2 receptor in the host cell membrane. Due to the error-prone entity of the viral RNA-dependent polymerase complex, the virus genome, including the coding region for the RBD, acquires new mutations, leading to the appearance of multiple variants. These variants can potentially impact transmission, virulence, antigenicity and evasive immune properties. S477N mutation located in the RBD has been observed in the SARS-CoV-2 omicron (B.1.1. 529) variant. In this study, we investigated the consequences of S477N mutation at the molecular level using computational approaches such as molecular dynamics simulation, protein-protein interaction analysis, immunoinformatics and free energy computation. We showed that displacement of Ser with Asn increases the stability of the spike protein and its affinity to ACE2 and thus increases the transmission potential of the virus. This mutation changes the folding and secondary structure of the spike protein. Also, it reduces antibody neutralization, raising concern about re-infection, vaccine breakthrough and therapeutic values.

Keywords: S477N, COVID-19, molecular dynamic, SARS-COV2 mutations

Procedia PDF Downloads 144
145 Functional Gene Expression in Human Cells Using Linear Vectors Derived from Bacteriophage N15 Processing

Authors: Kumaran Narayanan, Pei-Sheng Liew

Abstract:

This paper adapts the bacteriophage N15 protelomerase enzyme to assemble linear chromosomes as vectors for gene expression in human cells. Phage N15 has the unique ability to replicate as a linear plasmid with telomeres in E. coli during its prophage stage of life-cycle. The virus-encoded protelomerase enzyme cuts its circular genome and caps its ends to form hairpin telomeres, resulting in a linear human-chromosome-like structure in E. coli. In mammalian cells, however, no enzyme with TelN-like activities has been found. In this work, we show for the first-time transfer of the protelomerase from phage into human and mouse cells and demonstrate recapitulation of its activity in these hosts. The function of this enzyme is assayed by demonstrating cleavage of its target DNA, followed by detecting telomere formation based on its resistance to recBCD enzyme digestion. We show protelomerase expression persists for at least 60 days, which indicates limited silencing of its expression. Next, we show that an intact human β-globin gene delivered on this linear chromosome accurately retains its expression in the human cellular environment for at least 60 hours, demonstrating its stability and potential as a vector. These results demonstrate that the N15 protelomerse is able to function in mammalian cells to cut and heal DNA to create telomeres, which provides a new tool for creating novel structures by DNA resolution in these hosts.

Keywords: chromosome, beta-globin, DNA, gene expression, linear vector

Procedia PDF Downloads 163
144 Classification of Multiple Cancer Types with Deep Convolutional Neural Network

Authors: Nan Deng, Zhenqiu Liu

Abstract:

Thousands of patients with metastatic tumors were diagnosed with cancers of unknown primary sites each year. The inability to identify the primary cancer site may lead to inappropriate treatment and unexpected prognosis. Nowadays, a large amount of genomics and transcriptomics cancer data has been generated by next-generation sequencing (NGS) technologies, and The Cancer Genome Atlas (TCGA) database has accrued thousands of human cancer tumors and healthy controls, which provides an abundance of resource to differentiate cancer types. Meanwhile, deep convolutional neural networks (CNNs) have shown high accuracy on classification among a large number of image object categories. Here, we utilize 25 cancer primary tumors and 3 normal tissues from TCGA and convert their RNA-Seq gene expression profiling to color images; train, validate and test a CNN classifier directly from these images. The performance result shows that our CNN classifier can archive >80% test accuracy on most of the tumors and normal tissues. Since the gene expression pattern of distant metastases is similar to their primary tumors, the CNN classifier may provide a potential computational strategy on identifying the unknown primary origin of metastatic cancer in order to plan appropriate treatment for patients.

Keywords: bioinformatics, cancer, convolutional neural network, deep leaning, gene expression pattern

Procedia PDF Downloads 266
143 The Function of Polycomb Repressive Complex 2 (PRC2) In Plant Retrograde Signaling Pathway

Authors: Mingxi Zhou, Jiří Kubásek, Iva Mozgová

Abstract:

In Arabidopsis thaliana, histone 3 lysine 27 tri-methylation catalysed byPRC2 is playing essential functions in the regulation of plant development, growth, and reproduction[1-2]. Despite numerous studies related to the role of PRC2 in developmental control, how PRC2 works in the operational control in plants is unknown. In the present, the evidence that PRC2 probably participates in the regulation of retrograde singalling pathway in Arabidopsisis found. Firstly, we observed that the rosette size and biomass in PRC2-depletion mutants (clf-29 and swn-3) is significantly higher than WTunder medium light condition (ML: 125 µmol m⁻² s⁻²), while under medium high light condition (MHL: 300 µmol m⁻² s-2), the increase was reverse. Under ML condition, the photosynthesis related parameters determined by fluorCam did not show significant differences between WT and mutants, while the pigments concentration increased in the leaf of PRC2-depletion mutants, especially in swn. The dynamic of light-responsive genes and circadian clock genes expression by RT-qPCRwithin 24 hours in the mutants were comparable to WT. However, we observed upregulation of photosynthesis-associated nuclear genes in the PRC2-depletion mutants under chloroplast damaging condition (treated by lincomycin), corresponding to the so-called genome uncoupled (gun) phenotype. Here, we will present our results describing these phenotypes and our suggestion and outlook for studying the involvement of PRC2 in chloroplast-to-nucleus retrograde signalling.

Keywords: PRC2, retrograde signalling, light acclimation, photosyntheis

Procedia PDF Downloads 79
142 Nanowire Substrate to Control Differentiation of Mesenchymal Stem Cells

Authors: Ainur Sharip, Jose E. Perez, Nouf Alsharif, Aldo I. M. Bandeas, Enzo D. Fabrizio, Timothy Ravasi, Jasmeen S. Merzaban, Jürgen Kosel

Abstract:

Bone marrow-derived human mesenchymal stem cells (MSCs) are attractive candidates for tissue engineering and regenerative medicine, due to their ability to differentiate into osteoblasts, chondrocytes or adipocytes. Differentiation is influenced by biochemical and biophysical stimuli provided by the microenvironment of the cell. Thus, altering the mechanical characteristics of a cell culture scaffold can directly influence a cell’s microenvironment and lead to stem cell differentiation. Mesenchymal stem cells were cultured on densely packed, vertically aligned magnetic iron nanowires (NWs) and the effect of NWs on the cell cytoskeleton rearrangement and differentiation were studied. An electrochemical deposition method was employed to fabricate NWs into nanoporous alumina templates, followed by a partial release to reveal the NW array. This created a cell growth substrate with free-standing NWs. The Fe NWs possessed a length of 2-3 µm, with each NW having a diameter of 33 nm on average. Mechanical stimuli generated by the physical movement of these iron NWs, in response to a magnetic field, can stimulate osteogenic differentiation. Induction of osteogenesis was estimated using an osteogenic marker, osteopontin, and a reduction of stem cell markers, CD73 and CD105. MSCs were grown on the NWs, and fluorescent microscopy was employed to monitor the expression of markers. A magnetic field with an intensity of 250 mT and a frequency of 0.1 Hz was applied for 12 hours/day over a period of one week and two weeks. The magnetically activated substrate enhanced the osteogenic differentiation of the MSCs compared to the culture conditions without magnetic field. Quantification of the osteopontin signal revealed approximately a seven-fold increase in the expression of this protein after two weeks of culture. Immunostaining staining against CD73 and CD105 revealed the expression of antibodies at the earlier time point (two days) and a considerable reduction after one-week exposure to a magnetic field. Overall, these results demonstrate the application of a magnetic NW substrate in stimulating the osteogenic differentiation of MSCs. This method significantly decreases the time needed to induce osteogenic differentiation compared to commercial biochemical methods, such as osteogenic differentiation kits, that usually require more than two weeks. Contact-free stimulation of MSC differentiation using a magnetic field has potential uses in tissue engineering, regenerative medicine, and bone formation therapies.

Keywords: cell substrate, magnetic nanowire, mesenchymal stem cell, stem cell differentiation

Procedia PDF Downloads 169
141 TCTN2 Maintains the Transition Zone Stability and Controls the Entrance of the Ciliary Membrane Protein into Primary Cilia

Authors: Rueyhung Weng, Chia-En Huang, Jung-Chi-Liao

Abstract:

The transition zone (TZ) serves as a diffusion barrier to regulate the ins and outs of the proteins recruited to the primary cilia. TCTN2 is one of the TZ proteins and its mutation causes Joubert syndrome, a serious multi-organ disease. Despite its important medical relevance, the functions of TCTN2 remain elusive. Here we created a TCTN2 gene deleted retinal pigment epithelial cells (RPE1) using CRISPR/Cas9-based genome editing technique and used this knockout line to reveal roles of TCTN2. TCTN2 knockout RPE1 cells displayed a significantly reduced ciliogenesis or a shortened primary cilium length in the cilium-remaining population. Intraflagellar transport protein IFT88 aberrantly accumulated at the tip of TCTN2 deficient cells. Guanine nucleotide exchange factor Arl13B was mostly absent from the ciliary compartment, with a small population localizing at the ciliary tip. The deficient TZ was corroborated with the mislocalization of two other TZ proteins TMEM67 and MKS1. In addition, TCTN2 deficiency induced TZ impairment led to the suppression of Sonic hedgehog signaling in response to Smoothened (Smo) agonist. Together, depletion of TCTN2 destabilizes other TZ proteins and considerably alters the localization of key transport and signaling-associated proteins, including IFT88, Arl13B, and Smo.

Keywords: CRISPR/Cas9, primary cilia, Sonic hedgehog signaling, transition zone

Procedia PDF Downloads 313
140 Use of a New Multiplex Quantitative Polymerase Chain Reaction Based Assay for Simultaneous Detection of Neisseria Meningitidis, Escherichia Coli K1, Streptococcus agalactiae, and Streptococcus pneumoniae

Authors: Nastaran Hemmati, Farhad Nikkhahi, Amir Javadi, Sahar Eskandarion, Seyed Mahmuod Amin Marashi

Abstract:

Neisseria meningitidis, Escherichia coli K, Streptococcus agalactiae, and Streptococcus pneumoniae cause 90% of bacterial meningitis. Almost all infected people die or have irreversible neurological complications. Therefore, it is essential to have a diagnostic kit with the ability to quickly detect these fatal infections. The project involved 212 patients from whom cerebrospinal fluid samples were obtained. After total genome extraction and performing multiplex quantitative polymerase chain reaction (qPCR), the presence or absence of each infectious factor was determined by comparing with standard strains. The specificity, sensitivity, positive predictive value, and negative predictive value calculated were 100%, 92.9%, 50%, and 100%, respectively. So, due to the high specificity and sensitivity of the designed primers, they can be used instead of bacterial culture that takes at least 24 to 48 hours. The remarkable benefit of this method is associated with the speed (up to 3 hours) at which the procedure could be completed. It is also worth noting that this method can reduce the personnel unintentional errors which may occur in the laboratory. On the other hand, as this method simultaneously identifies four common factors that cause bacterial meningitis, it could be used as an auxiliary method diagnostic technique in laboratories particularly in cases of emergency medicine.

Keywords: cerebrospinal fluid, meningitis, quantitative polymerase chain reaction, simultaneous detection, diagnosis testing

Procedia PDF Downloads 83
139 A Report of 5-Months-Old Baby with Balanced Chromosomal Rearrangements along with Phenotypic Abnormalities

Authors: Mohit Kumar, Beklashwar Salona, Shiv Murti, Mukesh Singh

Abstract:

We report here a case of five-months old male baby, born as second child of non-consanguineous parents with no considerable history of genetic abnormality which was referred to our cytogenetic laboratory for chromosomal analysis. Physical dysmorphic facial features including mongoloid face, cleft palate, simian crease, and developmental delay were observed. We present this case with unique balanced autosomal translocation of t(3;10)(p21;p13). The risk of phenotypic abnormalities based on de novo balanced translocation was estimated to be 7%. The association of balanced chromosomal rearrangement with Down syndrome features such as multiple congenital anomalies, facial dysmorphism and congenital heart anomalies are very rare in a 5-months old male child. Trisomy-21 is not uncommon in chromosomal abnormality with the birth defect and balanced translocations are frequently observed in patients with secondary infertility or recurrent spontaneous abortion (RSA). Two ml heparinized peripheral blood cells cultured in RPMI-1640 for 72 hours supplemented with 20% fetal bovine serum, phytohemagglutinin (PHA), and antibiotics were used for chromosomal analysis. A total 30 metaphases images were captured using Olympus-BX51 microscope and analyzed using Bio-view karyotyping software through GTG-banding (G bands by trypsin and Giemsa) according to International System for Human Cytogenetic Nomenclature 2016. The results showed balanced translocation between short arm of chromosome # 3 and short arm of chromosome # 10. The karyotype of the child was found to be 46,XY,t(3;10)(p21; p13). Chromosomal abnormalities are one of the major causes of birth defect in new born babies. Also, balanced translocations are frequently observed in patients with secondary infertility or recurrent spontaneous abortion. The index case presented with dysmorphic facial features and had a balanced translocation 46,XY,t(3;10)(p21;p13). This translocation with break points at (p21; p13) has not been reported in the literature in a child with facial dysmorphism. To the best of our knowledge, this is the first report of novel balanced translocation t(3;10) with break points in a child with dysmorphic features. We found balanced chromosomal translocation instead of any trisomy or unbalanced aberrations along with some phenotypic abnormalities. Therefore, we suggest that such novel balanced translocation with abnormal phenotype should be reported in order to enable the pathologist, pediatrician, and gynecologist to have a better insight into the intricacies of chromosomal abnormalities and their associated phenotypic features. We hypothesized that dysmorphic features as seen in this case may be the result of change in the pattern of genes located at the breakpoint area in balanced translocations or may be due to deletion or mutation of genes located on the p-arm of chromosome # 3 and p-arm of chromosome # 10.

Keywords: balanced translocation, karyotyping, phenotypic abnormalities, facial dimorphisms

Procedia PDF Downloads 177
138 Allelic Diversity of Productive, Reproductive and Fertility Traits Genes of Buffalo and Cattle

Authors: M. Moaeen-ud-Din, G. Bilal, M. Yaqoob

Abstract:

Identification of genes of importance regarding production traits in buffalo is impaired by a paucity of genomic resources. Choice to fill this gap is to exploit data available for cow. The cross-species application of comparative genomics tools is potential gear to investigate the buffalo genome. However, this is dependent on nucleotide sequences similarity. In this study gene diversity between buffalo and cattle was determined by using 86 gene orthologues. There was about 3% difference in all genes in term of nucleotide diversity; and 0.267±0.134 in amino acids indicating the possibility for successfully using cross-species strategies for genomic studies. There were significantly higher non synonymous substitutions both in cattle and buffalo however, there was similar difference in term of dN – dS (4.414 vs 4.745) in buffalo and cattle respectively. Higher rate of non-synonymous substitutions at similar level in buffalo and cattle indicated a similar positive selection pressure. Results for relative rate test were assessed with the chi-squared test. There was no significance difference on unique mutations between cattle and buffalo lineages at synonymous sites. However, there was a significance difference on unique mutations for non synonymous sites indicating ongoing mutagenic process that generates substitutional mutation at approximately the same rate at silent sites. Moreover, despite of common ancestry, our results indicate a different divergent time among genes of cattle and buffalo. This is the first demonstration that variable rates of molecular evolution may be present within the family Bovidae.

Keywords: buffalo, cattle, gene diversity, molecular evolution

Procedia PDF Downloads 457
137 Identification of Two Novel Carbapenemase Gene Variants from a Carbapenem-Resistant Aeromonas Veronii Environmental Isolate

Authors: Rafael Estrada, Cristian Ruiz Rueda

Abstract:

Carbapenems are last-resort antibiotics used in clinical settings to treat antibiotic-resistant bacterial infections. Thus, the emergence and spread of resistance to carbapenems is a major public health concern. Here, we have studied a carbapenem-resistant Aeromonas veronii strain previously isolated from a water sample from Sam Simeon Creek (Hearst San Simeon State Park, CA). Analysis of this isolate using disk-diffusion, CarbaNP, eCIM and mCIM assays revealed that it was resistant to amoxicillin-clavulanic acid and all carbapenems tested and that this isolate produced a potentially novel carbapenemase of the Metallo-β-lactamase family. Whole genome sequencing analysis revealed that this A. veronii isolate carries a novel variant of the blacₚₕₐ class β-carbapenemase gene that was closely related to the blacₚₕₐ₇ gene of Aeromonas jandaei. This isolate also carried a novel variant of the blaₒₓₐ class D carbapenemase gene that was most closely related to the blaₒₓₐ-₉₁₂ gene found in other Aeromonas veronii isolates. Finally, we also identified a novel class C β-lactamase gene moderately related to the blaFₒₓ-₁₇ gene of Providencia stuartii and other blaFₒₓ variants identified in Klebsiella pneumoniae, Escherichia coli and other Enterobacteriaceae. Overall, our findings reveal that environmental isolates are an important reservoir of multiple carbapenemases and other β-lactamases of clinical significance.

Keywords: β-lactamases, carbapenem, antibiotic-resistant, aeromonas veronii

Procedia PDF Downloads 54
136 A Pipeline for Detecting Copy Number Variation from Whole Exome Sequencing Using Comprehensive Tools

Authors: Cheng-Yang Lee, Petrus Tang, Tzu-Hao Chang

Abstract:

Copy number variations (CNVs) have played an important role in many kinds of human diseases, such as Autism, Schizophrenia and a number of cancers. Many diseases are found in genome coding regions and whole exome sequencing (WES) is a cost-effective and powerful technology in detecting variants that are enriched in exons and have potential applications in clinical setting. Although several algorithms have been developed to detect CNVs using WES and compared with other algorithms for finding the most suitable methods using their own samples, there were not consistent datasets across most of algorithms to evaluate the ability of CNV detection. On the other hand, most of algorithms is using command line interface that may greatly limit the analysis capability of many laboratories. We create a series of simulated WES datasets from UCSC hg19 chromosome 22, and then evaluate the CNV detective ability of 19 algorithms from OMICtools database using our simulated WES datasets. We compute the sensitivity, specificity and accuracy in each algorithm for validation of the exome-derived CNVs. After comparison of 19 algorithms from OMICtools database, we construct a platform to install all of the algorithms in a virtual machine like VirtualBox which can be established conveniently in local computers, and then create a simple script that can be easily to use for detecting CNVs using algorithms selected by users. We also build a table to elaborate on many kinds of events, such as input requirement, CNV detective ability, for all of the algorithms that can provide users a specification to choose optimum algorithms.

Keywords: whole exome sequencing, copy number variations, omictools, pipeline

Procedia PDF Downloads 284
135 Correlation of P53 Gene Expression With Serum Alanine Transaminase Levels and Hepatitis B Viral Load in Cirrhosis and Hepatocellular Carcinoma Patients

Authors: Umme Shahera, Saifullah Munshi, Munira Jahan, Afzalun Nessa, Shahinul Alam, Shahina Tabassum

Abstract:

The development of HCC is a multi-stage process. Several extrinsic factors, such as aflatoxin, HBV, nutrition, alcohol, and trace elements are thought to initiate or/and promote the hepatocarcinogenesis. Alteration of p53 status is an important intrinsic factor in this process as p53 is essential for preventing inappropriate cell proliferation and maintaining genome integrity following genotoxic stress. This study was designed to assess the correlation of p53 gene expression with HBV-DNA and serum Alanine transaminase (ALT) in patients with cirrhosis and HCC. The study was conducted among 60 patients. The study population were divided into four groups (15 in each groups)-HBV positive cirrhosis, HBV negative cirrhosis, HBV positive HCC and HBV negative HCC. Expression of p53 gene was observed using real time PCR. P53 gene expressions in the above mentioned groups were correlated with serum ALT level and HBV viral load. p53 gene was significantly higher in HBV-positive patients with HCC than HBV-positive cirrhosis. Similarly, the expression of p53 was significantly higher in HBV-positive HCC than HBV-negative HCC patients. However, the expression of p53 was reduced in HBV-positive cirrhosis in comparison with HBV-negative cirrhosis. P53 gene expression in liver was not correlated with the serum levels of ALT in any of the study groups. HBV- DNA load also did not correlated with p53 gene expression in HBV positive HCC and HBV positive cirrhosis patients. This study shows that there was no significant change with the expression of p53 gene in any of the study groups with ALT level or viral load, though differential expression of p53 gene were observed in cirrhosis and HCC patients.

Keywords: P53, ALT, HBV-DNA, liver cirrhosis, hepatocellular carcinoma

Procedia PDF Downloads 66
134 Insight into Structure and Functions of of Acyl CoA Binding Protein of Leishmania major

Authors: Rohit Singh Dangi, Ravi Kant Pal, Monica Sundd

Abstract:

Acyl-CoA binding protein (ACBP) is a housekeeping protein which functions as an intracellular carrier of acyl-CoA esters. Given the fact that the amastigote stage (blood stage) of Leishmania depends largely on fatty acids as the energy source, of which a large part is derived from its host, these proteins might have an important role in its survival. In Leishmania major, genome sequencing suggests the presence of six ACBPs, whose function remains largely unknown. For functional and structural characterization, one of the ACBP genes was cloned, and the protein was expressed and purified heterologously. Acyl-CoA ester binding and stoichiometry were analyzed by isothermal titration calorimetry and Dynamic light scattering. Our results shed light on high affinity of ACBP towards longer acyl-CoA esters, such as myristoyl-CoA to arachidonoyl-CoA with single binding site. To understand the binding mechanism & dynamics, Nuclear magnetic resonance assignments of this protein are being done. The protein's crystal structure was determined at 1.5Å resolution and revealed a classical topology for ACBP, containing four alpha-helical bundles. In the binding pocket, the loop between the first and the second helix (16 – 26AA) is four residues longer from other extensively studied ACBPs (PfACBP) and it curls upwards towards the pantothenate moiety of CoA to provide a large tunnel space for long acyl chain insertion.

Keywords: acyl-coa binding protein (ACBP), acyl-coa esters, crystal structure, isothermal titration, calorimetry, Leishmania

Procedia PDF Downloads 418
133 Design and in Slico Study of the Truncated Spike-M-N SARS-CoV-2 as a Novel Effective Vaccine Candidate

Authors: Aghasadeghi MR., Bahramali G., Sadat SM., Sadeghi SA., Yousefi M., Khodaei K., Ghorbani M., Sadat Larijani M.

Abstract:

Background:The emerging COVID-19 pandemic is a serious concernfor the public health worldwide. Despite the many mutations in the virus genome, it is important to find an effective vaccine against viral mutations. Therefore, in current study, we aimed at immunoinformatic evaluation of the virus proteins immunogenicity to design a preventive vaccine candidate, which could elicit humoral and cellular immune responses as well. Methods:Three antigenic regions are included;Spike, Membrane, and Nucleocapsid amino acid sequences were obtained, and possible fusion proteins were assessed andcompared by immunogenicity, structural features, and population coverage. The best fusion protein was also evaluated for MHC-I and MHC-II T-cell epitopes and the linear and conformational B-cell epitopes. Results: Among the four predicted models, the truncated Spike protein in fusion with M and N proteins is composed of 24 highly immunogenic human MHC class I and 29 MHC class II, along with 14 B-cell linear and 61 discontinues epitopes. Also, the selected protein has high antigenicity and acceptable population coverage of 82.95% in Iran and 92.51% in Europe. Conclusion: The data indicate that the truncated Spike-M-N SARS-CoV-2form which could be potential targets of neutralizing antibodies. The protein also has the ability to stimulate humoral and cellular immunity. The in silico study provided the fusion protein as a potential preventive vaccine candidate for further in vivo evaluation.

Keywords: SARS-CoV-2, immunoinformatic, protein, vaccine

Procedia PDF Downloads 187
132 Gold Nano Particle as a Colorimetric Sensor of HbA0 Glycation Products

Authors: Ranjita Ghoshmoulick, Aswathi Madhavan, Subhavna Juneja, Prasenjit Sen, Jaydeep Bhattacharya

Abstract:

Type 2 diabetes mellitus (T2DM) is a very complex and multifactorial metabolic disease where the blood sugar level goes up. One of the major consequence of this elevated blood sugar is the formation of AGE (Advance Glycation Endproducts), from a series of chemical or biochemical reactions. AGE are detrimental because it leads to severe pathogenic complications. They are a group of structurally diverse chemical compounds formed from nonenzymatic reactions between the free amino groups (-NH2) of proteins and carbonyl groups (>C=O) of reducing sugars. The reaction is known as Maillard Reaction. It starts with the formation of reversible schiff’s base linkage which after sometime rearranges itself to form Amadori Product along with dicarbonyl compounds. Amadori products are very unstable hence rearrangement goes on until stable products are formed. During the course of the reaction a lot of chemically unknown intermediates and reactive byproducts are formed that can be termed as Early Glycation Products. And when the reaction completes, structurally stable chemical compounds are formed which is termed as Advanced Glycation Endproducts. Though all glycation products have not been characterized well, some fluorescence compounds e.g pentosidine, Malondialdehyde (MDA) or carboxymethyllysine (CML) etc as AGE and α-dicarbonyls or oxoaldehydes such as 3-deoxyglucosone (3-DG) etc as the intermediates have been identified. In this work Gold NanoParticle (GNP) was used as an optical indicator of glycation products. To achieve faster glycation kinetics and high AGE accumulation, fructose was used instead of glucose. Hemoglobin A0 (HbA0) was fructosylated by in-vitro method. AGE formation was measured fluorimetrically by recording emission at 450nm upon excitation at 350nm. Thereafter this fructosylated HbA0 was fractionated by column chromatography. Fractionation separated the proteinaceous substance from the AGEs. Presence of protein part in the fractions was confirmed by measuring the intrinsic protein fluorescence and Bradford reaction. GNPs were synthesized using the templates of chromatographically separated fractions of fructosylated HbA0. Each fractions gave rise to GNPs of varying color, indicating the presence of distinct set of glycation products differing structurally and chemically. Clear solution appeared due to settling down of particles in some vials. The reactive groups of the intermediates kept the GNP formation mechanism on and did not lead to a stable particle formation till Day 10. Whereas SPR of GNP showed monotonous colour for the fractions collected in case of non fructosylated HbA0. Our findings accentuate the use of GNPs as a simple colorimetric sensing platform for the identification of intermediates of glycation reaction which could be implicated in the prognosis of the associated health risk due to T2DM and others.

Keywords: advance glycation endproducts, glycation, gold nano particle, sensor

Procedia PDF Downloads 279
131 Genomic Sequence Representation Learning: An Analysis of K-Mer Vector Embedding Dimensionality

Authors: James Jr. Mashiyane, Risuna Nkolele, Stephanie J. Müller, Gciniwe S. Dlamini, Rebone L. Meraba, Darlington S. Mapiye

Abstract:

When performing language tasks in natural language processing (NLP), the dimensionality of word embeddings is chosen either ad-hoc or is calculated by optimizing the Pairwise Inner Product (PIP) loss. The PIP loss is a metric that measures the dissimilarity between word embeddings, and it is obtained through matrix perturbation theory by utilizing the unitary invariance of word embeddings. Unlike in natural language, in genomics, especially in genome sequence processing, unlike in natural language processing, there is no notion of a “word,” but rather, there are sequence substrings of length k called k-mers. K-mers sizes matter, and they vary depending on the goal of the task at hand. The dimensionality of word embeddings in NLP has been studied using the matrix perturbation theory and the PIP loss. In this paper, the sufficiency and reliability of applying word-embedding algorithms to various genomic sequence datasets are investigated to understand the relationship between the k-mer size and their embedding dimension. This is completed by studying the scaling capability of three embedding algorithms, namely Latent Semantic analysis (LSA), Word2Vec, and Global Vectors (GloVe), with respect to the k-mer size. Utilising the PIP loss as a metric to train embeddings on different datasets, we also show that Word2Vec outperforms LSA and GloVe in accurate computing embeddings as both the k-mer size and vocabulary increase. Finally, the shortcomings of natural language processing embedding algorithms in performing genomic tasks are discussed.

Keywords: word embeddings, k-mer embedding, dimensionality reduction

Procedia PDF Downloads 96
130 Non-Invasive Pre-Implantation Genetic Assessment Using NGS in IVF Clinical Routine

Authors: Katalin Gombos, Bence Gálik, Krisztina Ildikó Kalács, Krisztina Gödöny, Ákos Várnagy, József Bódis, Attila Gyenesei, Gábor L. Kovács

Abstract:

Although non-invasive pre-implantation genetic testing for aneuploidy (NIPGT-A) is potentially appropriate to assess chromosomal ploidy of the embryo, practical application of it in a routine IVF center has not been started in the absence of a recommendation. We developed a comprehensive workflow for a clinically applicable strategy for NIPGT-A based on next-generation sequencing (NGS) technology. We performed MALBAC whole genome amplification and NGS on spent blastocyst culture media of Day 3 embryos fertilized with intra-cytoplasmic sperm injection (ICSI). Spent embryonic culture media of morphologically good quality score embryos were enrolled in further analysis with the blank culture media as background control. Chromosomal abnormalities were identified by an optimized bioinformatics pipeline applying a copy number variation (CNV) detecting algorithm. We demonstrate a comprehensive workflow covering both wet- and dry-lab procedures supporting a clinically applicable strategy for NIPGT-A. It can be carried out within 48 h which is critical for the same-cycle blastocyst transfer, but also suitable for “freeze all” and “elective frozen embryo” strategies. The described integrated approach of non-invasive evaluation of embryonic DNA content of the culture media can potentially supplement existing pre-implantation genetic screening methods.

Keywords: next generation sequencing, in vitro fertilization, embryo assessment, non-invasive pre-implantation genetic testing

Procedia PDF Downloads 130