Feng Zhang lab’s recent work published in Science introduces a novel algorithm, fast locality-sensitive hashing-based clustering (FLSHclust), designed for mining large sequencing databases. This approach aims to address the limitations of existing methods in discovering rare protein families due to the exponential growth of databases. The research focused on systematically enumerating CRISPR-linked gene modules across all available sequencing data. CRISPR-Cas systems, known for their role in microbial immune defense and as the basis for genome editing technologies, have recently been linked to diverse biochemical activities like transposition and protease activity. The study hypothesizes that many more enzymatic activities associated with CRISPR systems could be discovered, some of which might be rare in existing databases.

Using FLSHclust, the researchers identified 188 previously unknown CRISPR-associated systems, including several rare ones. They experimentally characterized four new systems: a type IV system with an HNH nuclease domain, demonstrating a specific interference mechanism; two type I systems with HNH domains for precise DNA cleavage, which showed potential for genome editing in human cells; and candidate type VII systems indicating RNA targeting origins from type III-E CRISPR systems. Additionally, they discovered new effector and adaptation components, associations of Mu transposons with CRISPR systems, and proteins related to type V systems. They also noted a potential anti-CRISPR mechanism involving Cas9 and other non-CRISPR repeat arrays.

The study underscores the potential of FLSHclust in efficiently clustering millions of sequences and highlights the discovery of diverse biochemical activities linked to RNA-guided mechanisms in CRISPR systems, opening avenues for biotechnological advancements.

Reference: Altae-Tran H, Kannan S, Suberski AJ, Mears KS, Demircioglu FE, Moeller L, Kocalar S, Oshiro R, Makarova KS, Macrae RK, Koonin EV, Zhang F. Uncovering the functional diversity of rare CRISPR-Cas systems with deep terascale clustering. Science. 2023 Nov 24;382(6673):eadi1910. doi: 10.1126/science.adi1910. Epub 2023 Nov 23. PMID: 37995242.
About PackGene

PackGene is a CRO & CDMO technology company that specializes in packaging recombinant adeno-associated virus (rAAV) vectors. Since its establishment in 2014, PackGene has been a leader in the AAV vector CRO service field, providing tens of thousands of custom batches of AAV samples to customers in over 20 countries. PackGene offers a one-stop CMC solution for the early development, pre-clinical development, clinical trials, and drug approval of rAAV vector drugs for cell and gene therapy (CGT) companies that is fast, cost-effective, high-quality, and scalable. Additionally, the company provides compliant services for the GMP-scale production of AAVs and plasmids for pharmaceutical companies, utilizing five technology platforms, including the π-Alpha 293 cell AAV high-yield platform and the π-Omega plasmid high-yield platform. PackGene's mission is to make gene therapy affordable and accelerate the launch of innovative gene drugs. The company aims to simplify the challenging aspects of gene therapy development and industrialization processes and provide stable, efficient, and economical rAAV Fast Services to accelerate gene and cell therapy development efforts from discovery phase to commercialization.

Related News

How to make mRNA therapeutics safe from the start

The success of mRNA vaccines against COVID-19 has unleashed a flood of interest in using the technology to create more vaccines and treatments for everything from rare diseases and infections to cancer. But before new mRNA therapeutics are put to use, they need to be...

read more