We are standing at an intriguing juncture where the boundaries between biology and machine learning are increasingly blurred, paving the way for solutions that were unthinkable just a few years ago. Among these, the combination of protein folding models and latent diffusion techniques has become a compelling area of exploration. For entrepreneurs and innovators in life sciences or deeptech, this presents a growing opportunity to introduce novel solutions in synthetic biology, drug discovery, and biopharmaceuticals.
The idea of combining protein folding models, which predict the structure of proteins from their amino acid sequences, with latent diffusion isn’t new. It builds on years of foundational research, including significant advancements like DeepMind’s AlphaFold2. But now, we’re not just forecasting; we’re generating entirely new structures with real-world applications. The acceleration this brings to biotech startups is impossible to ignore. And yet, with every opportunity comes challenges that only creative thinking and business risk-taking can address. Let’s navigate this groundbreaking field together.
The core concept of latent diffusion for protein generation
Startups are often about doing more with less. Traditional protein engineering methods require extensive, expensive wet-lab experiments. That’s where this innovation holds a disruptive potential. By learning from the compressed “latent space” of protein structures and integrating the mathematics of diffusion, algorithms can now design protein structures rather than merely predicting them. These experiments rely on latent diffusion to iteratively transform random data into biologically functional designs. One, this reduces experimental costs. Two, it opens the door to creating protein structures that wouldn’t naturally occur.
Here’s the practical lens: instead of spending millions of dollars on biological trials, startups can first experiment in silico. If proteins could be generated computationally based on specific functional requirements, entire industries like enzyme manufacturing or biopharma could innovate faster than ever. Platforms like PLAID from Berkeley AI Research Lab are leading this shift by providing tools to simultaneously generate all-atom protein structures, saving startups months, if not years, of development time.
What makes this relevant now?
Let’s talk numbers. More than $25 billion has already been invested globally in companies leveraging AI for drug discovery, with projections indicating a compound annual growth rate of 35%. Emerging companies like Insitro and Recursion Pharmaceuticals, now worth billions, are already demonstrating that AI can find drug targets or optimize lead compounds more effectively than legacy methods. But when it comes to de novo protein design, where entirely new proteins are created from scratch, we’re at the starting line of what could be an equally dramatic leap.
Moreover, with the massive datasets produced by prior generations of protein folding models, like those from AlphaFold2 or ESMFold, the computational knowledge needed to succeed is finally hitting critical mass. For new entrants into the field, this means lower data acquisition costs, another critical factor for businesses looking to balance innovation costs.
How entrepreneurs can approach this field
Transitioning from predictive AI to generative approaches has its hurdles. While machine learning has reduced entry barriers in biotech, you’ll still need a few key elements in place. Here’s where to start:
-
Understand the biological need, not just the tech
Fancy algorithms don’t pay bills or deliver value unless tied to real-world applications. Is there an unmet demand in the market, such as faster vaccine development, cheaper enzymes, or synthetic super proteins? Dive into solving real problems, and the resources will follow. -
Forge interdisciplinary teams
A combination of biologists, computational chemists, and machine learning engineers is your winning lineup. Each element of this equation is essential, as bioinformatics doesn’t naturally overlap with generative AI. -
Leverage existing datasets and platforms
Resources like AlphaFold Protein Structures or tools like RFdiffusion are freely accessible. You don’t need to reinvent the wheel, use these as a starting point to accelerate your research. -
Validate early
A beta product or proof-of-concept is critical in fields intersecting with health. Partner with academic labs that can provide experimental validation; it’s often cheaper than funding trials alone.
Mistakes to avoid
Venturing into a field as nascent as protein generation with AI is inherently risky. Here are the common pitfalls:
-
Over-complicating the technology: Potential partners, funders, and even end-users need clear communication. Don’t bury them in jargon; simplify your pitch while letting the science speak.
-
Ignoring ethics and regulation: Synthetic biology has unique ethical concerns. Improper handling (data privacy with human proteins, environmental impacts, or unintended side effects) could sink your credibility.
-
Hiding behind automation: It’s tempting to assume algorithms will handle all the heavy lifting once trained. Remember, AI is only as good as the people guiding it and ensuring responsible use.
Real-world applications: how startups can act
Startups must align their product roadmaps with areas of urgent need. Here are areas primed for de novo protein structure design applications:
-
Personalized medicine
AI-designed proteins and tailored enzymes can target individual patients’ genetics, potentially revolutionizing treatments for cancer or rare genetic disorders. -
Greener manufacturing
A crucial use-case is creating tailored enzymes that catalyze industrial reactions at lower temperatures. This could reduce emissions across essential sectors including detergents, textiles, and biofuels. -
Diagnostic tools
Binding proteins, created through repurposed folding models, can detect specific disease biomarkers for cheaper and more accurate diagnostics, something the post-COVID-19 world will continue to require. -
Environmentally optimized solutions
With tools like PLAID combining 1D sequence and 3D atomic structure generation, we can design proteins that clean up environmental toxins, creating solutions for long-neglected ecological crises.
For companies, creating a robust proof-of-concept in such domains could attract the attention of venture capitalists or strategic partners in pharma and biotech.
Why does this matter for startups?
As someone who champions innovative entrepreneurship, I see a wealth of opportunities for new entrants in this space. What stands out is not just the possibility to create valuable, patent-protected IP but also a chance to integrate these solutions to address critical challenges.
Imagine being the startup known for creating a game-changing enzyme or the company whose AI system sped up the development of vital therapies. The stakes, and rewards, are unimaginable in magnitude.
The challenge is balancing scientific rigor with the pace of startups. Unlike established biopharma companies, which have the luxury of long timelines and big budgets, founders must find a lean pathway to innovation. This is part of why technologies like PLAID and ESMFold’s successors are so groundbreaking. They provide a way to work smarter, not harder, while still aiming for significant biological breakthroughs.
Final thoughts
For those willing to take the leap, this opportunity offers more than profitability, there’s a chance to solve some of humanity's most pressing problems. Focus on creating practical, understandable solutions that address clear market gaps. Bring the right teams together, prepare for the regulatory hurdles, and start small but scalable.
The time to harness AI for protein generation is now. Entrepreneurs in this space stand to pioneer the next chapter of synthetic biology and redefine how we approach life sciences.
By understanding both the business and science implications, entrepreneurs can build companies that don’t just survive but lead in uncharted areas, carving blue oceans in markets that barely existed until now. The race to reimagine protein design has begun. Will you join it?
FAQ
1. What is the significance of combining protein folding models with latent diffusion?
The integration allows for not only predicting protein structures but also designing entirely new ones with specific functionalities, enabling breakthroughs in synthetic biology and drug discovery. Read about protein generation advancements
2. How does latent diffusion help in protein generation?
Latent diffusion models transform random data into biologically relevant protein designs by learning from a compressed protein structure space while effectively reducing experimental costs. Find out more about latent diffusion models for proteins
3. Why is this field becoming relevant now?
The advancements in protein folding predictors like AlphaFold2, along with increasing datasets and a $25 billion market investment in AI-driven drug discovery, have set the stage for rapid innovation in this domain. Discover the global market impact of AI in drug discovery
4. What are the applications of AI-designed proteins?
AI-designed proteins can revolutionize personalized medicine, create enzymes for industrial applications, enable eco-friendly manufacturing, and improve diagnostic tools. Learn more about applications of protein design
5. What tools are currently being used for protein generation?
Technologies like PLAID and models like ESMFold or RFdiffusion are popular for generating protein sequences and 3D atomic structures. Explore PLAID for protein innovation
6. How can startups leverage this innovation?
Startups can save time and costs by using AI-driven tools for in-silico protein design, leveraging existing datasets, and forming interdisciplinary teams for rapid prototyping and validation.
7. What are some potential ethical concerns in synthetic biology?
Unregulated use of synthetic biology tools could lead to misuse, environmental impact, or inadvertently harmful bioengineering outcomes. Clear and strict ethical guidelines are crucial for this field.
8. What does the future hold for protein generation technologies?
The future involves developing more efficient models for large proteins, expanding into related fields like nucleic-acid-protein complexes, and scaling applications in therapeutics, diagnostics, and sustainability. Check out future directions for protein diffusion models
9. What challenges do entrepreneurs in this field face?
Key challenges include mastering the complexity of interdisciplinary knowledge, the cost of R&D, ensuring regulatory compliance, and effectively communicating scientific innovations.
10. Are there any open-source resources available for those interested in this field?
Yes, tools like RFdiffusion and PLAID have open-source repositories enabling researchers and developers to explore protein design systematically. Get started with RFdiffusion | Learn about PLAID on GitHub
About the Author
Violetta Bonenkamp, also known as MeanCEO, is an experienced startup founder with an impressive educational background including an MBA and four other higher education degrees. She has over 20 years of work experience across multiple countries, including 5 years as a solopreneur and serial entrepreneur. Throughout her startup experience she has applied for multiple startup grants at the EU level, in the Netherlands and Malta, and her startups received quite a few of those. She’s been living, studying and working in many countries around the globe and her extensive multicultural experience has influenced her immensely.
Violetta Bonenkamp's expertise in CAD sector, IP protection and blockchain
Violetta Bonenkamp is recognized as a multidisciplinary expert with significant achievements in the CAD sector, intellectual property (IP) protection, and blockchain technology.
CAD Sector:
- Violetta is the CEO and co-founder of CADChain, a deep tech startup focused on developing IP management software specifically for CAD (Computer-Aided Design) data. CADChain addresses the lack of industry standards for CAD data protection and sharing, using innovative technology to secure and manage design data.
- She has led the company since its inception in 2018, overseeing R&D, PR, and business development, and driving the creation of products for platforms such as Autodesk Inventor, Blender, and SolidWorks.
- Her leadership has been instrumental in scaling CADChain from a small team to a significant player in the deeptech space, with a diverse, international team.
IP Protection:
- Violetta has built deep expertise in intellectual property, combining academic training with practical startup experience. She has taken specialized courses in IP from institutions like WIPO and the EU IPO.
- She is known for sharing actionable strategies for startup IP protection, leveraging both legal and technological approaches, and has published guides and content on this topic for the entrepreneurial community.
- Her work at CADChain directly addresses the need for robust IP protection in the engineering and design industries, integrating cybersecurity and compliance measures to safeguard digital assets.
Blockchain:
- Violetta’s entry into the blockchain sector began with the founding of CADChain, which uses blockchain as a core technology for securing and managing CAD data.
- She holds several certifications in blockchain and has participated in major hackathons and policy forums, such as the OECD Global Blockchain Policy Forum.
- Her expertise extends to applying blockchain for IP management, ensuring data integrity, traceability, and secure sharing in the CAD industry.
Violetta is a true multiple specialist who has built expertise in Linguistics, Education, Business Management, Blockchain, Entrepreneurship, Intellectual Property, Game Design, AI, SEO, Digital Marketing, cyber security and zero code automations. Her extensive educational journey includes a Master of Arts in Linguistics and Education, an Advanced Master in Linguistics from Belgium (2006-2007), an MBA from Blekinge Institute of Technology in Sweden (2006-2008), and an Erasmus Mundus joint program European Master of Higher Education from universities in Norway, Finland, and Portugal (2009).
She is the founder of Fe/male Switch, a startup game that encourages women to enter STEM fields, and also leads CADChain, and multiple other projects like the Directory of 1,000 Startup Cities with a proprietary MeanCEO Index that ranks cities for female entrepreneurs. Violetta created the "gamepreneurship" methodology, which forms the scientific basis of her startup game. She also builds a lot of SEO tools for startups. Her achievements include being named one of the top 100 women in Europe by EU Startups in 2022 and being nominated for Impact Person of the year at the Dutch Blockchain Week. She is an author with Sifted and a speaker at different Universities. Recently she published a book on Startup Idea Validation the right way: from zero to first customers and beyond, launched a Directory of 1,500+ websites for startups to list themselves in order to gain traction and build backlinks and is building MELA AI to help local restaurants in Malta get more visibility online.
For the past several years Violetta has been living between the Netherlands and Malta, while also regularly traveling to different destinations around the globe, usually due to her entrepreneurial activities. This has led her to start writing about different locations and amenities from the POV of an entrepreneur. Here’s her recent article about the best hotels in Italy to work from.

