AI / LLM Engineering

Enterprise AI Search Deployment

An enterprise AI search and assistant platform surfaced across an organization's Workspace data and a curated internal knowledge base, with agentic actions and a nightly automated ingestion pipeline.

4+ (email, calendar, drive, wiki)

Data sources federated

Nightly, fully automated

Ingestion cadence

Zero

Unreviewed integrations left running

The Problem

Employees needed one place to ask natural-language questions and get answers grounded across multiple, previously siloed information sources (email, calendar, drive, internal wiki/intranet), with the ability to take real actions, governed by proper enterprise authentication.

My Approach

Configured and wired up federated data stores across the major productivity-suite sources plus a custom internal knowledge corpus, each scoped to appropriate access controls.
Built an automated, scheduled ETL pipeline that pulls internal wiki content on a nightly cadence, converts it to an indexable format, and publishes it into a "Documents with Metadata" data store — deliberately choosing that indexing mode so search results produce clickable, working citation links back to the source page instead of dead links.
Enabled agentic actions (not just search) on connected productivity sources, using per-user OAuth so actions execute with the requesting employee's own permissions rather than a shared service identity.
Ran the rollout as a scoped pilot (small named group) before wider expansion, with a clear go/no-go checklist and a hard trial-to-paid conversion deadline tracked explicitly.
Piloted a governed, read-only analytics connector so the assistant could answer sales-data questions from a cloud data warehouse — validated it end-to-end against real data, then made the deliberate call to pause and decommission it (revoke the app, firewall the service, retain only the reusable scaffolding) pending a broader security review.
Established a credential hygiene discipline: rotating any secret that had been exposed in a chat/session transcript, and documenting a rotation runbook for the OAuth client used by the platform.

Stack

AI Platform

Google Cloud Gemini Enterprise (Vertex AI Search / Discovery Engine)Federated Workspace connectorsAgentic actions with per-user OAuth

Pipeline

Cloud Run (scheduled job)Cloud SchedulerPythonSecret ManagerCloud Storage

Identity

OAuth 2.0Domain-wide delegationIAM role scoping

Data Integration (piloted)

Model Context Protocol (MCP) connector to a cloud data warehouse

Practices

Phased pilot rolloutCredential rotation hygieneSecurity-first decommissioning of an unreviewed integration

Skills Demonstrated

▸Enterprise search/AI-platform engineering across federated data sources
▸OAuth/identity federation and per-user permission enforcement in an AI context
▸Building reliable, scheduled ETL pipelines for content ingestion
▸Sound security judgment: shipping a pilot but knowing when to pause and gate something on review instead of scaling it
▸Vendor-platform troubleshooting (working through undocumented API behavior methodically rather than guessing)