The gene sequencing analysis based on the FANSE algorithm analysis cloud platform is successful on AWS
Author:China Economic Network Time:2022.09.21
Recently, Chengqi Bio and Amazon Cloud Services (AWS) successfully deployed a full -gene sequencing analysis cloud platform based on the FANSE algorithm on AWS, and opened to the world for free. Chengqi Bio will rely on the cloud computing acceleration system built by AWS to give full play to the performance advantages of the FANSE algorithm accurate, efficient, convenient and fast, strong scalability and other performance advantages to serve the world's gene sequencing enterprises and scientific research institutions. The operation of the FANSE algorithm on the AWS platform also marks that the precise group -based technical solutions independently developed by China have gone abroad to the world and contribute to the development of precision medical care.
Slow speed, the shortcomings of the traditional algorithm cannot be ignored
With the comprehensive arrival of the digital era, more and more companies have begun to migrate the application to the cloud, and gradually deepen from the periphery -type application to the core business systems such as production and decision -making. AWS is the world's largest cloud service provider. Millions of companies around the world provide cloud infrastructure with high security, strong scalability, and high reliability. At the same time, it also provides more than 200 functional services from global data centers. High overall network quality, low latency, low data packet loss, high application flexibility, etc., meet the diversified needs of different institutional enterprises such as government departments, traditional enterprises, old -fashioned Internet companies, and entrepreneurial technology companies. Due to the characteristics of good cloud computing elasticity, large bandwidth, high computing power, and paying on demand, it seems to be very suitable for gene sequencing analysis. On the AWS platform, there are indeed some large -scale sequencing analysis algorithms in the AWS platform, such as some BWT -based based on BWT -based The algorithm, but there are few practical applications in precision medicine and scientific research, because the application experience is actually not good. It is mainly concentrated in the slow speed and not allowing two issues.
The current mainstream second -generation gene sequencing is to randomly interrupt DNA or RNA into countless small fragments for parallel sequencing. In 1/4, it still takes a few hours to upload, and it has to be decompressed after passing. Subsequently, a large number of calculations such as sequence filtering, sequence comparison, statistical inspection, and database matching are needed to obtain meaningful test results. Traditional algorithm operations are not high. For example, genome mutation search often takes dozens of hours to run the entire process to complete the entire process. Essence In order to increase the speed of the algorithm, domestic cloud computing service providers have deployed FPGA hardware acceleration gene sequencing analysis systems. Nevertheless, its single -tasking processing speed is still longer. For example, analyzing the complete genealogy data set (regardless of network transmission) to complete a person's whole genome (regardless of network transmission), it still takes nearly 2 hours. need. In addition, the parameters of traditional sequencing analysis algorithms are complex. If there is no corresponding professional knowledge and experience trial and error, it is not easy to set the optimized parameter, which directly affects the detection rate and accuracy. Therefore, the company would rather buy the expensive server cluster and spend a lot of money to hire a letter analyst to analyze it locally, and it is rarely willing to use the cloud platform in actual business.
FANSE is launched on the AWS public cloud platform as the gene sequencing industry cost reduction and efficiency
The sensitivity comparison of FANSE (black line) and two international common algorithms (green and blue lines) on the test data set of body cell mutation standard
The FANSE algorithm is developed by Chengqi Bio. After many updates and iterations, it has now developed to the fourth generation. In common applications such as the mutation search and transcription group, its accuracy and stability have significantly surpassed the traditional algorithm. The comparison algorithm with the highest stability and accuracy so far. In terms of operation speed, it has set a world record for a 5 -minute analysis of a 30X full -genome sequencing data set for 5 minutes. Cheng Qi also independently developed a compression algorithm for FANSE, which can compress the sequencing data to up to 1/20 for transmission. It has reduced the time consuming network transmission, and it can be processed by FANSE without decompression. The private cloud platform based on the FANSE algorithm self -built by Chengqi Biological shows excellent performance. Users do not have to buy servers, do not have to master difficult biological information knowledge. You can complete the sequencing analysis to get a stable and accurate result. However, due to the limitation of bandwidth in private clouds, as the number of customers using Chengqi Cloud analysis increases, the data "tie" cannot be passed, and the bandwidth is "squeezed". The precise algorithm also lost its use of martial arts.
Today, the gene sequencing analysis cloud platform "moved" based on the FANSE algorithm to the AWS public cloud platform. The first thing that solves is the problem of network bandwidth. The total network of the public cloud distributed network is extremely wide, and it can carry a lot of users to upload the massive data. This is undoubtedly "tiger" for the FANSE algorithm. Its extremely efficient advantage can be fully displayed under the characteristics of public cloud elasticity:: Single tasks are fast to complete, and small -scale applications can be obtained for a moment after uploading. Large applications, such as all -genome sequencing analysis, only need to call more calculation cores. And FANSE does not need any hardware such as FPGA, GPU, etc. at all. It can achieve such a high speed by relying on the CPU computing. The universality is better. Cloud service providers do not need to configure special hardware. It can run well in existing hardware. Easy to upgrade to adapt to endless new applications. Secondly, the successful operation of the gene sequencing analysis of the FANSE algorithm on the AWS can allow gene sequencing enterprises and scientific research institutions from the world to enjoy accurate and efficient analysis services under the requirements of laws and regulations that meet sensitive data from various countries. Previously, before, before, before, before, before, before, before, before, before, before, before, before, before, before, before, before, before. Due to human genetic resources, many countries and regional governments legislative stipulate that gene sequencing data and samples are not allowed to leave the country, which also makes many overseas enterprises and scientific research institutions cannot apply the FANSE cloud platform for gene sequencing data analysis. Because the AWS platform has data centers in various countries, which perfectly meets the requirements of laws and regulations, it can allow the world to obtain gene sequencing analysis services based on the FANSE algorithm, thereby promoting the rapid development of global gene sequencing and precision medical industry.
As far as enterprises are concerned, FANSE's successful operation in AWS can achieve cost reduction and efficiency, and for inheritance creatures, in today's international context, pure domestic independent research and development technology can obtain the world's largest cloud service provider Highly recognized and global deployment is a good start of the Sino -US gene sequencing industry's reverse technology overflow effect. In the future, inheritance biology will continue to cultivate the field of learning technology to promote industry development with more domestic innovation technology, and on the world stage on the world stage Make more Chinese voices and empower "more accurate precision medicine".
Introduction to Shenzhen Chengqi Biotechnology Co., Ltd.
Chengqi creatures are comprehensive and precise medical platforms based on autonomous core technology "multi -group+informatics" to provide medical services, IVDs and treatment programs. They are committed to providing people with precision medical and health management solutions for people with frontier technology. There are four national high -tech enterprises and one licensed and inspection centers.
Chengqi creatures have a completely high -precision gene sequencing data analysis FANSE algorithm that has a complete independent research and development and obtained highly recognized internationally recognized. The FANSE algorithm set a world record of algorithm accuracy and speed in 2020, and was recommended as the core pillar of the international human protein group plan. Chengqi Bio also established the first domestic full -gene detection process. In the genome, transcription group, translation group, protein group, and metabolic group, there are accurate autonomous techniques. This process is used as a national medical life group learning Blueprint of quality control standards.
- END -
Tencent, Baidu fancy layout CRM
The picture comes from Canva.As early as 2019, almost all Internet giants in China...
At 8 o'clock tonight!Double coupons to snatch!
2022 Huimou Hubei consumer couponSince the first batch of distribution on June 13t...