Handling Massive PDF files with Mamba and ModernBERT: A Model ComparisonIn today’s data-driven environment, vast amounts of information remain locked within lengthy PDF documents — such as research papers…Mar 23Mar 23
RAG-powered AI: Querying Clinical Notes with an Intelligent Chat AssistantIn the rapidly growing healthcare data landscape, quickly and accurately retrieving relevant clinical insights presents a significant…Mar 15Mar 15
BigQuery and Postgres as Vector DatabasesWith the explosion of Large Language Models (LLMs) and their applications in Retrieval-Augmented Generation (RAG), the demand for efficient…Mar 7Mar 7
Multi-model Clinical AI agentAt their core, AI agents are systems designed to perform tasks, make decisions, and interact with their environment autonomously, often…Jan 18Jan 18
Trigger GitHub’s “Repository Dispatch” Event From ServiceNowConfluent allows users to perform various resource deployment and access provisioning tasks manually from the online portal. Our priority…Sep 27, 2024Sep 27, 2024
Confluent Cloud Kafka Metrics API and BigQuery to track cluster usageMonitoring the usage of clusters on the Confluent Cloud Kafka platform is crucial from a FinOps perspective. It’s not just about tracking…May 17, 2024May 17, 2024
Data Governance for Topics in Confluent Cloud KafkaWhen it comes to managing topics in Confluent Cloud Kafka, one aspect stands out for its crucial role in ensuring the accuracy…May 5, 2024May 5, 2024
Self-service capability for requesting BigQuery dataset creation — part2In part1, I provided details about the self-service capability I built for the creation of BigQuery datasets through Internal Developer…Dec 27, 2023Dec 27, 2023
Self-service capability for requesting BigQuery dataset creation — part1In my previous blog posts, I have explained the scale of the data platform my team owns and manages on GCP. We manage almost all GCP data…Dec 4, 2023Dec 4, 2023