Building AI With MongoDB: Integrating Vector Search And Cohere to Build Frontier Enterprise Apps

Mat Keep
April 25, 2024
#genAI

Cohere is the leading enterprise AI platform, building large language models (LLMs) which help businesses unlock the potential of their data. Operating at the frontier of AI, Cohere’s models provide a more intuitive way for users to retrieve, summarize, and generate complex information.

Cohere offers both text generation and embedding models to its customers. Enterprises running mission-critical AI workloads select Cohere because its models offer the best performance-cost tradeoff and can be deployed in production at scale. Cohere’s platform is cloud-agnostic. Their models are accessible through their own API as well as popular cloud managed services, and can be deployed on a virtual private cloud (VPC) or even on-prem to meet companies where their data is, offering the highest levels of flexibility and control.

Cohere’s leading Embed 3 and Rerank 3 models can be used with MongoDB Atlas Vector Search to convert MongoDB data to vectors and build a state-of-the-art semantic search system. Search results also can be passed to Cohere’s Command R family of models for retrieval augmented generation (RAG) with citations.

Check out our AI resource page to learn more about building AI-powered apps with MongoDB.

A new approach to vector embeddings

It is in the realm of embedding where Cohere has made a host of recent advances. Described as “AI for language understanding,” Embed is Cohere’s leading text representation language model. Cohere offers both English and multilingual embedding models, and gives users the ability to specify the type of data they are computing an embedding for (e.g., search document, search query). The result is embeddings that improve the accuracy of search results for traditional enterprise search or retrieval-augmented generation.

One challenge developers faced using Embed was that documents had to be passed one by one to the model endpoint, limiting throughput when dealing with larger data sets. To address that challenge and improve developer experience, Cohere has recently announced its new Embed Jobs endpoint. Now entire data sets can be passed in one operation to the model, and embedded outputs can be more easily ingested back into your storage systems.

Additionally, with only a few lines of code, Rerank 3 can be added at the final stage of search systems to improve accuracy. It also works across 100+ languages and offers uniquely high accuracy on complex data such as JSON, code, and tabular structure. This is particularly useful for developers who rely on legacy dense retrieval systems.

Demonstrating how developers can exploit this new endpoint, we have published the How to use Cohere embeddings and rerank modules with MongoDB Atlas tutorial. Readers will learn how to store, index, and search the embeddings from Cohere. They will also learn how to use the Cohere Rerank model to provide a powerful semantic boost to the quality of keyword and vector search results.

**Figure 1:** Illustrating the embedding generation and search workflow shown in the tutorial

Why MongoDB Atlas and Cohere?

MongoDB Atlas provides a proven OLTP database handling high read and write throughput backed by transactional guarantees. Pairing these capabilities with Cohere’s batch embeddings is massively valuable to developers building sophisticated gen AI apps. Developers can be confident that Atlas Vector Search will handle high scale vector ingestion, making embeddings immediately available for accurate and reliable semantic search and RAG. Increasing the speed of experimentation, developers and data scientists can configure separate vector search indexes side by side to compare the performance of different parameters used in the creation of vector embeddings.

In addition to batch embeddings, Atlas Triggers can also be used to embed new or updated source content in real time, as illustrated in the Cohere workflow shown in Figure 2.

**Figure 2:** MongoDB Atlas Vector Search supports Cohere’s batch and real time workflows. (Image courtesy of Cohere)

Supporting both batch and real-time embeddings from Cohere makes MongoDB Atlas well suited to highly dynamic gen AI-powered apps that need to be grounded in live, operational data. Developers can use MongoDB’s expressive query API to pre-filter query predicates against metadata, making it much faster to access and retrieve the more relevant vector embeddings. The unification and synchronization of source application data, metadata, and vector embeddings in a single platform, accessed by a single API, makes building gen AI apps faster, with lower cost and complexity. Those apps can be layered on top of the secure, resilient, and mature MongoDB Atlas developer data platform that is used today by over 45,000 customers spanning startups to enterprises and governments handling mission-critical workloads.

What's next?

To start your journey into gen AI and Atlas Vector Search, review our 10-minute Learning Byte. In the video, you’ll learn about use cases, benefits, and how to get started using Atlas Vector Search.

← Previous

Five Languages, One Goal: A Developer's Path to Certification Mastery

MongoDB Community Creator Markandey Pathak has become a certified developer in five different programming languages: C#, Java, Node.JS, PHP, and Python. Pursuing multiple certifications equips developers with a diverse skill set, making them invaluable team members. Fluency across different programming languages enables them to foster platform-agnostic solutions and promote adaptability, collaboration, and informed decision-making, which are crucial for success in the global tech landscape. To understand what led Markandey to take on so many certifications while managing a busy and successful career, we spoke with him to gain insights into the challenges and triumphs he faced. What motivated you to pursue certification in multiple programming languages, and how has achieving such a diverse set of skills impacted your career? C was the first programming language I learned, followed by C# and the .NET ecosystem a few years later. Transitioning to a new language like C# after knowing one was straightforward. I then delved into ASP.NET, JAVA, and subsequently PHP. Despite the differing syntax of these languages, I found that fundamental programming concepts remained consistent. This enlightening realization led me to explore JavaScript and, later, Python. Such a diverse skill set made me a go-to resource for many senior leaders seeking insights. This versatility allowed me to transcend categorization based on programming ecosystems in the workplace, evolving my mindset to develop platform-agnostic solutions. I believe in the adage of being a jack of all trades while still mastering one or more. I took on the challenge of discovering MongoDB drivers available for various platforms. I created sample applications to practice basic MongoDB concepts using specific drivers, and soon, everything fell into place effortlessly. What tips or advice would you share with someone who looks up to your achievement and aspires to become a certified developer in multiple languages like C#, Java, Node.JS, PHP, and Python? How can they effectively approach learning and mastering these languages? Before attempting proficiency in MongoDB across multiple languages, it's crucial to prioritize understanding fundamental concepts such as data modeling practices, CRUD operations, and indexes. Mastering MongoDB's shell, MongoSh, is essential to grasp the workings of MongoDB's read and write operations. Following this, individuals should select a programming environment they're most adept in and practice executing MongoDB operations within that ecosystem. Constructing a personal project can aid in practically observing various MongoDB concepts in action. Utilizing resources such as MongoDB Certification Learning Paths , practice tests, and MongoDB Documentation is vital for excelling in certification exams. Additionally, it's advisable to undertake the initial certification in the programming language one feels most comfortable with. Reflection is key; saving or emailing exam scores enables individuals to identify areas needing improvement for future attempts. With proficiency in C#, Java, Node.JS, PHP, and Python, how do you perceive the role of versatility in today's tech industry, especially regarding job opportunities and project flexibility? Programming languages, very much like spoken languages, are merely a medium. The most important thing is knowing what to say. The tech industry depends on problems, and developers seek solutions to them. Once they have a solution, programming languages help make those solutions a reality. It’s not hard to learn different programming languages or even to master them. Knowing the basics of different programming ecosystems can give developers an edge regarding job opportunities. It makes them flexible and enables them to make crucial and informed decisions in choosing the correct tech stack or defining good architecture for solutions. In your experience, how does fluency in multiple languages enhance collaboration and innovation within development teams, particularly in today's globalized tech landscape? Fluency or even practical awareness about programming languages or ecosystems promotes versatility in problem-solving, facilitates cross-functional collaboration, supports agile development, enables integration with legacy systems, fosters global collaboration, reduces dependency, and empowers informed decision-making, all of which are crucial for staying competitive in today's globalized tech landscape. As a MongoDB Community Creator, how do you leverage your expertise in these five languages to contribute to and engage with the broader tech community? What advice would you offer aspiring developers seeking to expand their skill set? I aim to open-source my MongoDB-focused projects across various ecosystems, accompanied by detailed articles outlining their construction. Since these projects were designed with exams in mind, they serve as skill-testing tools for developers and comprehensive guides to the various components comprising certification exams. I advocate for developers to choose a favorite language and compare others to it, as this approach facilitates a quicker and more efficient understanding of concepts. Relating new information to familiar concepts makes learning easier and more effective. The MongoDB Community Advocacy Program is a vibrant global community designed for MongoDB enthusiasts who are passionate about advocating for the platform. Our Community Creators Program welcomes members of all skill levels eager to deepen their involvement in advancing MongoDB's community and technology. We empower our members to expand their expertise, visibility, and leadership by actively engaging with and advocating for MongoDB technologies among users worldwide. Join us and amplify your impact within the MongoDB community! Elevate your career with MongoDB University 's 1,000+ learning assets. Access free courses and hands-on labs, and earn certifications to boost your skills and stand out in tech.

April 24, 2024

Next →

데일리샷, MongoDB Atlas로 스마트 주류 검색 서비스를 혁신하다

주류 시장에 불어온 새로운 바람 일부 전통주를 제외하고 오프라인 판매만 가능했던 한국 주류 시장은 2020년 온라인 판매 규제가 개정되면서 새로운 전환점을 맞이했습니다. 앱으로 언제 어디서나 원하는 주류를 주문할 수 있는 스마트 오더 서비스는 한국 소비자가 즐겨 찾는 새로운 주류 구매 방식으로 자리 잡으며 일상 전반에 편리함을 가져왔습니다. 데일리샷(Dailyshot) 은 이러한 변화를 선도적으로 이끌며 주류 경험의 새로운 기준을 정립한 국내 1위 온라인 주류 플랫폼입니다. 2020년 하반기 발빠르게 서비스를 시작한 데일리샷은 앱 기반 주류 스마트 오더 서비스를 통해 누구나 프리미엄 주류를 둘러보고 합리적인 가격으로 구매하며 매장이나 택배 등 선호하는 방식으로 수령할 수 있는 플랫폼을 제공하고 있습니다. 데이터 관리와 비즈니스 구현에 대한 고민 소비자의 주류 구매 과정 전반에서 접근성을 높일 방법을 고민하던 데일리샷은 비즈니스 성장에 따라 앱 내 검색 기능을 고도화하고 방대한 상품 종류와 픽업지 데이터를 효과적으로 관리하기 위한 전문적인 기술이 필요했습니다. 가령 고객과 가까운 동네나 주류 픽업을 희망하는 지역을 선택하기 위해서는 필터 기능이 필수적입니다. 그러나 데일리샷이 기존 사용하던 인메모리(in-memory) 데이터베이스의 Geospatial 기능은 간단한 필터링을 지원하지 않아 추가적인 서버 자원이 소모되며 비용 증가와 API 응답 지연을 야기했습니다. 또한 데일리샷의 기존 프레임워크 상에서 상품 검색을 위한 MySQL의 full-text search 기능을 사용할 수 없어 추가 리소스를 도입해야 했습니다. 상세한 검색결과를 얻기 위해서는 브랜드나 상품명, 전통주, 와인과 같은 주종, 카테고리 등 다양한 요소를 고려한 데이터 구조를 구축해야 합니다. 그러나 스타트업의 특성 상 추가 리소스를 부담하면서 full-text search를 위한 관리 구조를 만들 인력도 녹록치 않은 상황이었습니다. 데일리샷은 세계 각국의 다양한 주류를 제공하고 있기에 주문 및 픽업 방식 역시 다양합니다. 같은 상품이라도 해외 직구, 직접 픽업 등 고객의 주문 방식에 따라 옵션이 다르기 때문에 관리해야 하는 데이터가 많고 복잡합니다. 기존 사용 중인 RDBMS에서 이 같이 다양한 옵션을 아우르는 상품 테이블을 종합하는 것은 비용과 시간 모두 상당한 자원 낭비를 가져왔으며, 고객에게 데이터를 제공하기까지 상당한 시간이 소요됐습니다. 데일리샷이 제공하는 주류 픽업 및 상품 검색 서비스 성공적인 검색 서비스 고도화를 위한 여정 서비스와 고객경험 개선을 위해 고민하던 데일리샷은 기존 사용 중인 AWS를 기반으로 MongoDB Atlas를 도입했습니다. 먼저 데일리샷은 MongoDB Atlas에서 바로 컬렉션과 쿼리를 생성해 필터링을 위한 Geospatial 기능을 간편하게 구현하며 지연시간을 기존 0.3-0.5에서 0.1초로 최소화하고, MongoDB Atlas Search로 full-text search를 위한 준비를 빠르게 마칠 수 있었습니다. 최희재 데일리샷 CTO는 “다른 경쟁 서비스들과 비교하며 고심한 결과, 학습 곡선이나 유지 보수 효율성 측면에서 MongoDB Atlas Search가 우세했다”며 “MongoDB Atlas Search는 기존 사용하던 MySQL의 full-text search와 차이가 있지만 MongoDB가 제공하는 상세 가이드라인을 기반으로 쉽게 적용할 수 있었다. 기능 개발부터 서비스 배포까지 전 과정을 불과 2주만에 완료하며 고객들에게 빠르게 신기능을 선보일 수 있었다”고 강조했습니다. 최희재 CTO는 특히 MongoDB의 full-text search 기능이 검색을 위한 인덱스 구성이 쉽고 MongoDB Atlas Dashboard나 MongoDB Compass와 같은 GUI(Graphical User Interface)로 구성할 수 있다는 점을 매력 요소로 꼽았습니다. 데일리샷은 추후 Atlas Search를 서비스 전반에 도입해 퍼지 검색(fuzzy search), 자동 완성(autocomplete) 등 다양한 검색 관련 기능에 접목할 계획입니다. 독보적인 주류 경험을 제공하는 기업으로 성큼 나아가다 MongoDB Atlas 및 MongoDB Atlas Search 도입 후 데일리샷의 고객경험은 눈에 띄게 개선됐습니다. 원하는 검색 결과를 얻지 못하는 검색 실패율이 더욱 낮아졌고, Voice of Customer(VoC)를 통한 검색 관련 기술 요구 사항의 90%를 해결할 수 있었습니다. 또한 MongoDB 도입 후 RDB 인프라 자원의 사용이 줄어들면서 비용의 20% 절감할 수 있었습니다. 최희재 CTO는 “MongoDB Korea가 제공하는 양질의 기술은 물론 문제 발생 시 빠르고 정확하게 대응할 수 있도록 지원하는 점이 인상 깊었다”며 성공적인 MongoDB 도입에는 무엇보다 MongoDB Korea 팀의 적극적인 지원이 뒤따랐다고 강조했습니다. 이어 “기술 측면에서 MongoDB Atlas Dashboard로 간편한 모니터링과 slow 쿼리를 프로파일링 할 수 있었고, MongoDB Compass 앱을 통해 쿼리를 작성하고 테스트하며 실제 코드 적용까지의 전 과정을 신속하게 진행할 수 있었다. MongoDB에 익숙지 않는 개발자에게는 자세한 설명을 담은 기술 문서가 큰 도움이 됐다”고 덧붙였습니다. 데일리샷은 다양한 데이터를 아우르는 고도화된 검색 기능을 제공하면서 고객의 긍정적인 반응을 체감했고, 향후 유연한 insert 조건을 갖춘 MongoDB를 통해 로그 및 시각화를 구현하고 Atlas Vector Search로 더욱 개선된 검색 기능을 구축할 계획입니다. 지속적인 서비스 혁신을 통해 데일리샷은 2024년 기준 월간 활성 사용자수(MAU) 67만 명, 누적 앱 설치 수 150만 건을 기록하며 서비스 시작 3년만에 한국 최대 주류 플랫폼으로서 입지를 공고히 다지고 있습니다. 최희재 CTO는 “데일리샷은 단순히 주류를 구매할 수 있는 플랫폼에 그치지 않고 주류 시장 전반에 긍정적인 영향을 끼치는 기업이 되는 것이 목표”라며 “MongoDB와의 지속적인 협력을 바탕으로 고객의 다양한 니즈를 반영한 선도적인 서비스로 업계와 함께 성장하는 선순환 구조를 만들 것"이라며 포부를 드러냈습니다.

May 3, 2024