Documentation

Everything you need to integrate permission-aware retrieval into your AI systems.

Quick Start

Get permission-aware retrieval running in under five minutes. Install the SDK, connect your vector database, and execute your first secured retrieval.

Python

pip install gateco

from gateco_sdk import GatecoClient

with GatecoClient("https://api.gateco.ai") as client:
    client.login("admin@company.com", "password")

    # Execute a permission-aware retrieval
    result = client.retrievals.execute(
        connector_id="conn_abc",
        principal_id="user_123",
        query_vector=[0.1, 0.2, 0.3, ...],
        top_k=10,
    )

    for item in result.results:
        print(item.vector_id, item.score)

TypeScript

npm install @gateco/sdk

import { GatecoClient } from "@gateco/sdk";

const client = new GatecoClient({
  baseUrl: "https://api.gateco.ai",
});
await client.login("admin@company.com", "password");

// Execute a permission-aware retrieval
const result = await client.retrievals.execute({
  connectorId: "conn_abc",
  principalId: "user_123",
  queryVector: [0.1, 0.2, 0.3],
  topK: 10,
});

for (const item of result.results) {
  console.log(item.vectorId, item.score);
}
client.close();

CLI

pip install gateco

# Login
gateco login --email admin@company.com --password secret

# Test your connector
gateco connectors test conn_abc

# Execute a retrieval
gateco retrieve --connector-id conn_abc \
  --principal-id user_123 \
  --vector-file embedding.json --top-k 10

Core Concepts

Gateco is a permission-aware retrieval layer, the security middleware between your AI agents and vector databases. It enforces access policies at retrieval time using a deny-by-default model.

Connectors

Connections to your vector databases. Gateco supports 12 connectors: pgvector, Pinecone, Qdrant, Weaviate, Milvus, Chroma, OpenSearch, Supabase, Neon, Azure AI Search, Vertex AI Vector Search, and Vertex AI Search. Tier 1 connectors (pgvector, Supabase, Neon, Pinecone, Qdrant) support full ingestion workflows.

Policies

Rules that determine who can access what data. Gateco supports RBAC (role-based), ABAC (attribute-based), and REBAC (relationship-based) policies. Policies have a lifecycle: Draft → Active → Archived. Only active policies are enforced.

Secured Retrievals

Permission-checked queries against your vector database. Every retrieval evaluates all active policies against the requesting principal and target resources. Outcomes are: Allowed (all results pass), Partial (some filtered), or Denied (no access).

Principals

The identities requesting access: users, service accounts, or AI agents. Principals are synced from your identity providers (Azure Entra ID, AWS IAM, Okta, GCP) or managed directly.

Gated Resources

Your vector data with security metadata attached: classification (public/internal/confidential/restricted), sensitivity (low/medium/high/critical), domain, and ownership. Resources are bound to connectors.

Readiness Levels (L0-L4)

A semantic measure of your security posture per connector. L0: Not connected. L1: Connection verified. L2: Search operational. L3: Resource-level policies active. L4: Chunk-level enforcement (highest granularity). See our blog post on semantic readiness levels for a deep dive.

Search Modes

Gateco supports four search modes on every retrieval. The mode determines how the underlying vector store is queried before policy enforcement runs. All four modes return the same policy-filtered result envelope. keyword, hybrid, and vector modes produce a ranked list with scores; grep mode returns exact matches with a match_count and sort_order.

Build vs. Buy

The most common question we hear: "Our engineers could build this." You're right. They could. Here's what they'd be maintaining:

In-house retrieval governance surface area

•Policy DSL parser and evaluator (RBAC + ABAC + REBAC condition language)
•12 connector adapters (each vector DB has a different SDK, auth method, and query API)
•Audit trail schema that's stable enough for compliance review
•Metadata resolution hierarchy (inline → sql_view → sidecar fallback)
•Identity provider sync adapters (Okta, Azure Entra, AWS IAM, GCP Cloud Identity)
•Classification suggestion engine
•SCIM v2 provisioning endpoint

The bottom line

That's 6–12 months of engineering time for a v1, and a maintenance burden that grows with every new connector or policy type added. Gateco Team at $499/mo replaces roughly 0.5 engineering FTE focused on retrieval governance. We've done that work. You ship product features.

Connectors

Connect Gateco to your vector database. Each connector manages connection credentials, search configuration, and metadata resolution. See our blog post on metadata resolution modes for a detailed comparison of sidecar, inline, and SQL view strategies.

Create a Connector

# Python
connector = client.connectors.create(
    name="Production Pinecone",
    type="pinecone",
    config={
        "api_key": "pk-...",
        "environment": "us-east-1",
        "index_name": "knowledge-base",
    },
)

# Test the connection
result = client.connectors.test(connector.id)
print(result.status, result.latency_ms)

Supported Connectors

Connector	Type	Tier	Ingestion Support
pgvector	pgvector	Tier 1	Full
Supabase	supabase	Tier 1	Full
Neon	neon	Tier 1	Full
Pinecone	pinecone	Tier 1	Full
Qdrant	qdrant	Tier 1	Full
OpenSearch	opensearch	Tier 2	Retroactive only
Weaviate	weaviate	Tier 2	Retroactive only
Milvus	milvus	Tier 2	Retroactive only
Chroma	chroma	Tier 2	Retroactive only
Azure AI Search	azure_ai_search	Tier 2	Retroactive only
Vertex AI Vector Search	vertex_ai_vector_search	Tier 2	Search only
Vertex AI Search	vertex_ai_search	Tier 2	Retroactive + hybrid search

Metadata Resolution Modes

Gateco resolves policy-relevant metadata via a configurable hierarchy per connector:

•sidecar (default): Metadata managed in Gateco's own registry
•inline: Metadata extracted from vector payloads (requires metadata_field_mapping)
•sql_view: Metadata read from a Postgres view (Postgres-family connectors only)
•auto: Tries inline → sql_view → sidecar in order

Policies

Define who can access what data through RBAC and ABAC policies. Policies are evaluated at every retrieval in a deny-by-default model.

Create a Policy

# Python — RBAC policy with proper condition format
policy = client.policies.create(
    name="Engineering Data Access",
    type="rbac",
    effect="allow",
    description="Allow engineering team to access internal docs",
    resource_selectors=["connector_abc"],
    rules=[{
        "effect": "allow",
        "priority": 1,
        "description": "Engineers can access internal docs",
        "conditions": [
            # resource. prefix = check resource metadata
            {"field": "resource.classification", "operator": "lte", "value": "internal"},
            # principal. prefix = check requesting identity
            {"field": "principal.roles", "operator": "contains", "value": "engineer"},
        ],
    }],
)

# Activate the policy
client.policies.activate(policy.id)

Policy Lifecycle

Policies go through three states:

•Draft: Created but not enforced. Safe to edit and test.
•Active: Enforced on all retrievals. Use the Access Simulator (Growth+) to test before activating.
•Archived: Retained for audit history but no longer enforced.

Policy Condition Fields

Condition fields MUST be prefixed with resource. or principal. Bare field names silently resolve against the principal, not the resource.

•Resource fields: resource.classification, resource.sensitivity, resource.domain, resource.labels, resource.encryption_mode
•Principal fields: principal.roles, principal.groups, principal.attributes.*
•Operators: eq, ne, in, contains, lte (ordered level comparison), gte (ordered level comparison)
•WARNING: Bare field names (e.g., "classification" without prefix) check principal attributes, not resource metadata
•Deny policy gotcha: when a deny policy's selectors match but no rules match, the policy-level effect=deny fires. Add a catch-all allow rule to deny only specific conditions.

Ingestion

Ingest documents with security metadata attached. Available for Tier 1 connectors (pgvector, Supabase, Neon, Pinecone, Qdrant).

Single Document

response = client.ingest.document(
    connector_id="conn_abc",
    external_resource_id="doc-quarterly-report",
    text="Q4 2025 financial results...",
    classification="confidential",
    sensitivity="high",
    domain="finance",
    labels=["quarterly", "financial"],
    owner_principal_id="user_cfo",
)
print(f"Ingested {response.chunk_count} chunks")

Batch Ingestion

response = client.ingest.batch(
    connector_id="conn_abc",
    records=[
        {
            "external_resource_id": "doc-001",
            "text": "First document content...",
            "classification": "internal",
        },
        {
            "external_resource_id": "doc-002",
            "text": "Second document content...",
            "classification": "public",
        },
    ],
)
print(f"Ingested {response.total_chunks} total chunks")

CLI Ingestion

# Single file
gateco ingest report.txt --connector-id conn_abc \
  --classification confidential --sensitivity high

# Batch directory
gateco ingest-batch ./documents --connector-id conn_abc \
  --glob "*.md"

Classification Suggestions

Automatically scan your connector resources and get rule-based classification suggestions. Review suggestions, then apply the ones you approve.

Suggest & Apply

# Scan resources and get suggestions
suggestions = client.connectors.suggest_classifications(
    connector_id="conn_abc",
    scan_limit=1000,
    sample_size=10,
)

for s in suggestions.suggestions:
    print(f"{s.resource_key}: {s.suggested_classification} "
          f"({s.confidence:.0%}) — {s.reasoning}")

# Apply approved suggestions
result = client.connectors.apply_suggestions(
    connector_id="conn_abc",
    suggestions=suggestions.suggestions,
)
print(f"Applied {result.applied}, created {result.resources_created} resources")

Classification Levels

Classification	Sensitivity Levels	Example
Public	Low	Blog posts, FAQs, product docs
Internal	Medium	Wiki pages, onboarding materials
Confidential	High	HR records, employee data
Restricted	Critical	Financial reports, legal contracts

Access Simulator

Dry-run policy evaluation to test what a principal can access before going live. Available on Growth and Enterprise plans.

Run a Simulation

# Python
result = client.simulator.run(
    principal_id="user_123",
    connector_id="conn_abc",
)

print(f"Matched: {result.matched_count}")
print(f"Allowed: {result.allowed_count}")
print(f"Denied: {result.denied_count}")

for trace in result.traces:
    print(f"  {trace.resource_id}: {trace.decision} — {trace.reason}")

Audit Trail

Every operation is logged with 50+ audit event types covering retrievals, policy changes, connector operations, ingestion, and more. Export as CSV/JSON (Growth+) or stream to your SIEM (Enterprise). Read our blog post on building compliance-ready AI systems for best practices.

Query Audit Events

# List recent retrieval events
page = client.audit.list(
    event_types="retrieval_allowed,retrieval_denied",
    per_page=50,
)

for event in page.items:
    print(f"{event.event_type} by {event.actor} at {event.created_at}")

# Export (Growth tier)
export = client.audit.export_csv(
    date_from="2025-01-01",
    date_to="2025-03-31",
    format="csv",
)

Event Types

Audit events are grouped into categories:

•User: login, logout, settings_changed
•Connector: added, updated, tested, removed, sync events
•Policy: created, updated, activated, archived, deleted
•Retrieval: allowed, denied (with full policy trace)
•Data: metadata_bound, document_ingested, batch_ingested
•Identity Provider: added, updated, removed, synced
•Pipeline: created, updated, run, error

API Reference

The Gateco REST API provides 50+ endpoints across 17 route modules. All endpoints require JWT authentication (obtained via login) or a static API key. Requests and responses use JSON.

Authentication

# Login to get JWT tokens
curl -X POST https://api.gateco.ai/api/auth/login \
  -H "Content-Type: application/json" \
  -d '{"email":"admin@company.com","password":"secret"}'

# Use the access token
curl https://api.gateco.ai/api/connectors \
  -H "Authorization: Bearer <access_token>"

Key Endpoints

Method	Path	Description
POST	/api/auth/login	Authenticate and get tokens
GET	/api/connectors	List connectors
POST	/api/connectors	Create connector
POST	/api/connectors/:id/test	Test connectivity
POST	/api/connectors/:id/bind	Bind metadata to resources
POST	/api/connectors/:id/suggest-classifications	Get classification suggestions
POST	/api/connectors/:id/apply-suggestions	Apply approved suggestions
GET	/api/policies	List policies
POST	/api/policies	Create policy
POST	/api/policies/:id/activate	Activate a policy
POST	/api/retrievals/execute	Execute secured retrieval
GET	/api/retrievals	List past retrievals
POST	/api/v1/ingest	Ingest single document
POST	/api/ingest/batch	Batch ingest documents
POST	/api/simulator/run	Run access simulation (Growth+)
GET	/api/audit-log	List audit events
POST	/api/audit-log/export	Export audit log (Growth+)
GET	/api/identity-providers	List identity providers
POST	/api/retroactive/register	Register existing vectors

Search Modes

Pass search_mode to retrievals.execute() (and answers.execute()) to control how the underlying vector store is queried. Policy enforcement runs after the search result set is assembled, regardless of mode.

•vector (default): Dense embedding similarity search. Pass query_vector. Best for semantic questions.
•keyword: Sparse BM25-style keyword search. Pass query as a plain string. Best for exact terminology lookups.
•hybrid: Blends vector and keyword scores. Pass query and alpha (1.0 = all-vector, 0.0 = all-keyword). Best general-purpose mode.
•grep: Literal substring / regex scan. Pass query as the pattern. Returns match_count and sort_order on the result. Best for log IDs, error codes, or exact strings.

# Keyword search — exact terminology
result = client.retrievals.execute(
    connector_id="conn_abc",
    principal_id="user_123",
    query="quarterly revenue report",
    search_mode="keyword",
    top_k=10,
)

# Hybrid search — blend vector + keyword (alpha controls the mix)
result = client.retrievals.execute(
    connector_id="conn_abc",
    principal_id="user_123",
    query="quarterly revenue report",
    search_mode="hybrid",
    alpha=0.5,  # 1.0=all-vector, 0.0=all-keyword
    top_k=10,
)

# Grep search — literal pattern / regex scan
result = client.retrievals.execute(
    connector_id="conn_abc",
    principal_id="user_123",
    query="ERR-4021",
    search_mode="grep",
)
print(result.match_count, result.sort_order)

# Answer synthesis also accepts search_mode
answer = client.answers.execute(
    query="What's our Q4 revenue?",
    connector_id="conn_abc",
    principal_id="user_123",
    search_mode="hybrid",
    alpha=0.5,
)

Rate Limits & Plan Limits

Resource	Free	Team	Growth	Enterprise
Secured retrievals/mo	1,000	50,000	500,000	Unlimited
Connectors	1	3	10	Unlimited
Identity providers	1	1	5	Unlimited
Policies	3	Unlimited	Unlimited	Unlimited
Team members (incl. pending invites)	1	5	25	Unlimited

Identity Providers

Sync principals from your existing identity infrastructure. Gateco supports four identity provider types with scheduled sync and SCIM provisioning.

Supported Providers

Provider	Type Value	Auth Config	SCIM Support
Azure Entra ID	azure_entra_id	Tenant ID + Client Secret	Yes (Enterprise app)
Okta	okta	Domain + SSWS API Token	Yes (SCIM app integration)
AWS IAM Identity Center	aws_iam	Region + Access Key	Sync only (no native SCIM)
GCP Cloud Identity	gcp	Service Account JSON	Sync only (no native SCIM)

Configure a Provider

# Python
idp = client.identity_providers.create(
    name="Company Okta",
    type="okta",
    config={
        "domain": "company.okta.com",
        "api_token": "...",
    },
)

# Sync principals from the provider
client.identity_providers.sync(idp.id)

Auto-Sync Configuration

Enable scheduled sync to keep principals up to date automatically. Set a sync interval (15, 30, or 60 minutes) on any connected IDP. Gateco uses PostgreSQL advisory locks for multi-instance coordination, so only one instance runs sync per IDP per interval. Disable the scheduler with DISABLE_SYNC_SCHEDULER=true.

# Update IDP with sync interval (via API)
client.identity_providers.update(
    idp.id,
    sync_config={"sync_interval_minutes": 30},
)

SCIM Provisioning (Enterprise)

Enterprise plans support inbound SCIM v2 provisioning for real-time user and group sync. Generate a SCIM bearer token, configure your IDP to push changes to the Gateco SCIM endpoint, and users/groups are provisioned automatically. SCIM tokens are one-per-IDP: generating a new token revokes the previous one.

# Generate a SCIM token (plaintext returned once)
token = client._request(
    "POST",
    f"/api/identity-providers/{idp.id}/scim-token",
)
# Configure your IDP with:
#   Endpoint: https://api.gateco.ai/api/scim/v2
#   Bearer Token: token["token"]

SCIM Endpoints

Method	Endpoint	Description
POST	/api/scim/v2/Users	Create a user (maps to Principal)
GET	/api/scim/v2/Users/{id}	Get user by ID
GET	/api/scim/v2/Users	List users (ListResponse)
PATCH	/api/scim/v2/Users/{id}	Update user attributes
DELETE	/api/scim/v2/Users/{id}	Deactivate user (soft delete)
POST	/api/scim/v2/Groups	Create a group
GET	/api/scim/v2/Groups/{id}	Get group with members
GET	/api/scim/v2/Groups	List groups
PATCH	/api/scim/v2/Groups/{id}	Update group (add/remove members)
DELETE	/api/scim/v2/Groups/{id}	Delete group (hard delete)

Deep dives

Standalone guides for specific integration patterns and features.

Production deployment guide

Reach L3 governance in two weeks: Postgres or Pinecone with Okta or Entra ID. Step-by-step from connector setup to first secured retrieval.

MCP server

Connect Claude Desktop, Cursor, and other MCP hosts to Gateco. Install, credential setup, and all six tool parameters.

Agent identity patterns

Three patterns for integrating AI agents with Gateco: bot-as-principal, delegated employee identity, and external user authorization.

Need help?

Check out the Python SDK and TypeScript SDK on GitHub for source code and examples, or contact us for support.

Building with an AI coding assistant? llms-full.txt provides LLM-friendly context for integration.

Get started free