Features

Turn your documents into an intelligent, always-ready sales assistant.
BABEH combines knowledge base management, structured product data, and AI-powered search into a single platform your team can deploy in minutes.

Knowledge Base Management

Upload any document and make it instantly searchable.

Your files are automatically split into optimized chunks, indexed for full-text search, and embedded into a vector database for semantic understanding.

  • File Upload — Drag and drop PDF or TXT files up to 50 MB. Documents are parsed, chunked, and indexed automatically.
  • URL Ingestion — Paste one or multiple web URLs. BABEH scrapes the main content and indexes it alongside your uploaded files.
  • Smart Chunking — Configure chunk size (100–10,000 characters) and overlap (0–5,000 characters) to match your content type. Technical manuals need different chunking than marketing copy.
  • Document Library — Browse, search, filter by type (file or URL), sort by chunk count, file size, or upload date. Paginate through 10, 25, or 50 records at a time.
  • Edit & Re-index — Update document content at any time. BABEH automatically re-chunks and re-embeds the updated content. Re-index a single document or the entire library in one click.
  • AI Content Improvement — Let the AI review your document chunks and suggest clearer, more complete rewrites. Review the suggestion, accept it, and save — all without leaving the editor.

Pricing Database

Give your AI real-time access to structured pricing data.

A dedicated, schema-driven pricing database that integrates directly into the search pipeline. When a customer asks "how much does a Honda Veloz cost?", the AI pulls live pricing data — not stale document snippets.

  • Full CRUD — Add, edit, and delete pricing records through a clean form interface.
  • Bulk Import — Import from CSV or JSON. Column headers are automatically mapped to your schema fields.
  • Export — Download your entire pricing database as CSV or JSON with schema version metadata included.
  • Full-Text Search — Search across product name, SKU, category, and description with weighted relevance (product name weighted 2× higher than other fields).
  • Smart Price Detection — Natural language price queries are understood automatically: "between 200 million and 300 million" — range search "under 1 billion" — upper-bound filter "budget 500 million" — lower-bound filter "around 300 million" — approximate search with ±20% tolerance
  • VectorDB Sync — Push pricing records into the vector database so they appear in semantic search results alongside document content.

Product Specifications

Structure your product data with 30+ fields including 15 boolean feature flags.

Purpose-built for automotive and product-heavy businesses. Every specification is searchable, filterable, and available to the AI during query time.

  • Comprehensive Schema — Brand, model, variant, year, transmission, engine type, fuel type, drive system, body type, seating capacity, price, and free-text spec notes.
  • 15 Feature Flags — Cooling seat, sunroof, 360 camera, wireless charging, cruise control, collision warning, lane assist, auto parking, Apple CarPlay, Android Auto, keyless entry, push start, rear camera, electric seat, heated seat.
  • Three Import Formats — JSON, CSV, and Excel (.xlsx / .xls). The only data type in BABEH that supports Excel import.
  • Dual Edit Mode — Edit through a structured form or switch to raw JSON for bulk field editing.
  • Full-Text Search — Search across brand, model, variant, transmission, engine type, drive system, body type, and spec notes simultaneously.

Dynamic Schema Editor

Modify your database structure at runtime — no migrations, no downtime.

Add, rename, or remove fields from the pricing database without writing a single line of code. BABEH handles the ALTER TABLE, FTS rebuild, and VectorDB re-sync automatically.

  • Runtime Field Management — Add new columns, rename existing ones, or remove fields you no longer need. Changes take effect immediately.
  • 11 Configurable Properties Per Field — Name, display label, data type (text / number / integer / boolean), required flag, default value, display format (currency / badge / textarea / boolean badge), FTS indexed flag, FTS weight, VectorDB inclusion flag, VectorDB template, and sort order.
  • Automatic Rebuild — When you change FTS or VectorDB settings, BABEH rebuilds the affected indexes in the background. A status indicator shows rebuild progress.
  • Version History — Every schema change is versioned. Browse the full history and view a side-by-side diff between any two versions to see exactly what changed.

Superadmin access only.


Hybrid Search Engine

Three search methods. One unified result.

BABEH doesn't rely on a single search technique. It combines vector similarity, keyword matching, and structured data lookup — then fuses the results with configurable weights.

  • Vector Search — Semantic similarity via ChromaDB. Understands meaning, not just keywords. "affordable family car" matches "budget-friendly minivan" even though they share no words.
  • BM25 Keyword Search — Full-text search via SQLite FTS5. Fast, precise, and excellent for exact product names, SKUs, and model numbers.
  • Hybrid Fusion — Combine vector and BM25 results with configurable weights (default 50/50). Tune the balance to match your use case.
  • Query Decomposition — Multi-topic queries like "compare Honda Veloz vs Toyota Avanza" are automatically split into sub-queries for more diverse, complete retrieval.
  • Automatic Data Source Detection — The engine analyzes each query for pricing keywords (13 terms in Indonesian and English) and specification keywords (25+ terms). When detected, it automatically searches the relevant structured database in addition to the knowledge base.
  • Relevance Threshold — Results below the configured threshold (default 0.25) are filtered out. Score visualization uses color coding: green for strong matches, yellow for moderate, red for weak.
  • Citation System — Every AI response includes numbered source citations. Users can verify the information source for any claim in the response.
  • Streaming Responses — Answers stream in real-time via Server-Sent Events. Users see the response forming word by word instead of waiting for the full answer.

Analytics Dashboard

Understand how your knowledge base is being used.

Five analytics views give you full visibility into queries, performance, token consumption, and user behavior.

  • Summary Dashboard — Total queries, 7-day query volume, active days, average processing time, total token usage, model usage breakdown, and data volume in MB.
  • LLM Usage Metrics — Track input tokens, output tokens, and total tokens over configurable periods (7, 14, 30, 60, or 90 days). Visualized as line charts for token trends and bar charts for daily query volume.
  • Search Insights — Unique search terms, average words per query, top queries ranked by frequency, and an interactive word cloud generated from all search history.
  • User Analytics — Browser distribution, operating system breakdown, device type split (desktop / mobile / tablet), and geographic distribution of users.
  • Query History — Full searchable log of every query with detail view. Filter by date range. Export to CSV for offline analysis.

Embeddable Chat Widget

Add AI-powered search to any website with two lines of code.

A self-contained JavaScript widget that runs inside a Shadow DOM — completely isolated from your site's CSS and JavaScript. No dependencies. No conflicts.

  • Two-Line Installation — Add a script tag and call KBaaSWidget.init(). That's it.
  • Zero Dependencies — No jQuery, no React, no framework required. Works on any website.
  • Shadow DOM Isolation — The widget's styles and scripts are fully encapsulated. Your site's CSS cannot break the widget, and the widget cannot break your site.
  • Programmatic APIsearch(), clear(), destroy(), setTheme(), configure(), getConfig() — full control from your application code.
  • Auto Theme Detection — Automatically matches the user's OS preference (light or dark). Manual override available via API.
  • Responsive Design — Adapts to desktop, tablet, and mobile viewports. Four breakpoints ensure the widget looks right on every screen size.
  • Markdown Rendering — AI responses render with full Markdown support: headers, lists, bold, code blocks, tables.
  • Keyboard ShortcutCtrl+Enter to submit a query.

User & Access Management

Four roles. Granular permissions. Full audit trail.

  • Four Roles — Superadmin (full access), Manager (content + settings), Editor (content only), Analyst (read-only analytics and settings).
  • User Management — Create, edit, deactivate, and delete user accounts. Assign roles at creation or change them later. Superadmin only.
  • Activity Log — Every POST, PUT, and DELETE request is automatically logged with timestamp, user, IP address, HTTP method, endpoint, status code, and user agent. Login passwords are automatically redacted. 120-day retention. Filterable by search term, HTTP method, and date range. Export to CSV.

Configuration & Tuning

Fine-tune every aspect of the AI behavior without touching code.

SettingRangeDefault
LLM ProviderAWS Bedrock / Google GeminiBedrock
Search MethodHybrid / Vector Only / BM25 OnlyHybrid
Vector vs BM25 Weight0–100% each50 / 50
Temperature0.0 – 1.00.3
Relevance Threshold0.0 – 1.00.25
Top-K Results1 – 5015
Chunk Size100 – 10,000 chars500
Chunk Overlap0 – 5,000 chars50
Use LLMOn / OffOn
StreamingOn / OffOn

Developer & Debug Tools

When something doesn't look right, see exactly what the search engine is doing.

  • Search Debugger — Run any query through the full pipeline and inspect all 9 stages: configuration, keyword detection, result summary, LLM context window, threshold filtering, hybrid fusion scores, raw vector results, raw BM25 results, and structured data (pricing + specs) results. Each result shows its relevance score with color-coded bars.
  • VectorDB Viewer — Browse every record in the vector database. View embeddings, metadata, and source documents. Filter by source type. Superadmin only.

Infrastructure

  • SQLite for document metadata, pricing, product specs, settings, and user accounts — no external database server required
  • ChromaDB for vector embeddings — runs embedded, no separate service needed
  • MySQL for query history and activity logs — optional, for high-volume analytics
  • FastAPI backend with async support and Server-Sent Events for streaming
  • Fully idempotent migrations — run migrate_server.py on every deploy, safe to repeat

BABEH is built for teams who need a knowledge-powered AI assistant without the complexity of building one from scratch.
Try it now →

Fitur

Ubah dokumen Anda menjadi asisten penjualan cerdas yang selalu siap melayani.
BABEH menggabungkan manajemen knowledge base, data produk terstruktur, dan pencarian berbasis AI dalam satu platform yang bisa di-deploy oleh tim Anda dalam hitungan menit.

Manajemen Knowledge Base

Unggah dokumen apapun dan langsung bisa dicari.

File Anda secara otomatis dipecah menjadi chunk yang optimal, diindeks untuk pencarian full-text, dan di-embed ke dalam vector database untuk pemahaman semantik.

  • Unggah File — Drag and drop file PDF atau TXT hingga 50 MB. Dokumen diproses, dipecah, dan diindeks secara otomatis.
  • Ingest URL — Tempel satu atau beberapa URL halaman web. BABEH mengekstrak konten utama dan mengindeksnya bersama file yang telah diunggah.
  • Smart Chunking — Atur ukuran chunk (100–10.000 karakter) dan overlap (0–5.000 karakter) sesuai jenis konten Anda. Dokumen teknis butuh pengaturan chunking yang berbeda dari materi marketing.
  • Perpustakaan Dokumen — Telusuri, cari, filter berdasarkan tipe (file atau URL), urutkan berdasarkan jumlah chunk, ukuran file, atau tanggal unggah. Paginasi 10, 25, atau 50 data per halaman.
  • Edit & Re-index — Perbarui konten dokumen kapan saja. BABEH secara otomatis melakukan re-chunk dan re-embed konten yang diperbarui. Re-index satu dokumen atau seluruh perpustakaan dalam satu klik.
  • Perbaikan Konten dengan AI — Biarkan AI meninjau chunk dokumen Anda dan menyarankan penulisan ulang yang lebih jelas dan lengkap. Tinjau saran, terima, dan simpan — semuanya tanpa meninggalkan editor.

Database Harga

Berikan AI Anda akses real-time ke data harga terstruktur.

Database harga yang digerakkan oleh skema, terintegrasi langsung ke dalam pipeline pencarian. Ketika pelanggan bertanya "berapa harga Honda Veloz?", AI mengambil data harga terkini — bukan potongan dokumen yang sudah usang.

  • CRUD Lengkap — Tambah, edit, dan hapus data harga melalui antarmuka form yang bersih.
  • Import Massal — Import dari CSV atau JSON. Header kolom secara otomatis dipetakan ke field skema Anda.
  • Ekspor — Unduh seluruh database harga sebagai CSV atau JSON lengkap dengan metadata versi skema.
  • Pencarian Full-Text — Cari di seluruh nama produk, SKU, kategori, dan deskripsi dengan relevansi berbobot (nama produk diberi bobot 2× lebih tinggi dari field lainnya).
  • Deteksi Harga Cerdas — Query harga dalam bahasa natural dipahami secara otomatis: "antara 200 juta sampai 300 juta" — pencarian rentang "dibawah 1 miliar" — filter batas atas "budget 500 juta" — filter batas bawah "sekitar 300 juta" — pencarian perkiraan dengan toleransi ±20%
  • Sinkronisasi VectorDB — Dorong data harga ke vector database agar muncul di hasil pencarian semantik bersama konten dokumen.

Spesifikasi Produk

Strukturkan data produk Anda dengan 30+ field termasuk 15 flag fitur boolean.

Dirancang khusus untuk bisnis otomotif dan bisnis yang memiliki banyak produk. Setiap spesifikasi bisa dicari, difilter, dan tersedia untuk AI saat proses query.

  • Skema Komprehensif — Brand, model, varian, tahun, transmisi, tipe mesin, jenis bahan bakar, sistem penggerak, tipe bodi, kapasitas penumpang, harga, dan catatan spesifikasi bebas.
  • 15 Flag Fitur — Jok pendingin, sunroof, kamera 360, wireless charging, cruise control, peringatan tabrakan, lane assist, parkir otomatis, Apple CarPlay, Android Auto, keyless entry, push start, kamera belakang, jok elektrik, jok pemanas.
  • Tiga Format Import — JSON, CSV, dan Excel (.xlsx / .xls). Satu-satunya tipe data di BABEH yang mendukung import Excel.
  • Mode Edit Ganda — Edit melalui form terstruktur atau beralih ke JSON mentah untuk pengeditan field secara massal.
  • Pencarian Full-Text — Cari di seluruh brand, model, varian, transmisi, tipe mesin, sistem penggerak, tipe bodi, dan catatan spesifikasi secara bersamaan.

Editor Skema Dinamis

Modifikasi struktur database saat runtime — tanpa migrasi, tanpa downtime.

Tambah, ubah nama, atau hapus field dari database harga tanpa menulis satu baris kode pun. BABEH menangani ALTER TABLE, rebuild FTS, dan re-sync VectorDB secara otomatis.

  • Manajemen Field Runtime — Tambah kolom baru, ubah nama kolom yang ada, atau hapus field yang tidak diperlukan lagi. Perubahan langsung berlaku.
  • 11 Properti yang Dapat Dikonfigurasi Per Field — Nama, label tampilan, tipe data (teks / angka / integer / boolean), flag wajib, nilai default, format tampilan (mata uang / badge / textarea / boolean badge), flag indeks FTS, bobot FTS, flag penyertaan VectorDB, template VectorDB, dan urutan sortir.
  • Rebuild Otomatis — Saat Anda mengubah pengaturan FTS atau VectorDB, BABEH membangun ulang indeks yang terpengaruh di latar belakang. Indikator status menampilkan progres rebuild.
  • Riwayat Versi — Setiap perubahan skema diversikan. Telusuri riwayat lengkap dan lihat perbandingan berdampingan (diff) antara dua versi mana pun.

Hanya akses Superadmin.


Mesin Pencarian Hybrid

Tiga metode pencarian. Satu hasil terpadu.

BABEH tidak bergantung pada satu teknik pencarian. Sistem ini menggabungkan kesamaan vektor, pencocokan kata kunci, dan pencarian data terstruktur — lalu menggabungkan hasilnya dengan bobot yang dapat dikonfigurasi.

  • Pencarian Vektor — Kesamaan semantik melalui ChromaDB. Memahami makna, bukan hanya kata kunci. "mobil keluarga terjangkau" cocok dengan "minivan hemat budget" meskipun tidak berbagi kata yang sama.
  • Pencarian Kata Kunci BM25 — Pencarian full-text melalui SQLite FTS5. Cepat, presisi, dan sangat baik untuk nama produk, SKU, dan nomor model yang tepat.
  • Fusi Hybrid — Gabungkan hasil vektor dan BM25 dengan bobot yang dapat dikonfigurasi (default 50/50).
  • Dekomposisi Query — Query multi-topik seperti "bandingkan Honda Veloz vs Toyota Avanza" secara otomatis dipecah menjadi sub-query untuk pengambilan hasil yang lebih beragam dan lengkap.
  • Deteksi Sumber Data Otomatis — Mesin menganalisis setiap query untuk kata kunci harga (13 istilah) dan kata kunci spesifikasi (25+ istilah). Saat terdeteksi, sistem secara otomatis mencari database terstruktur yang relevan.
  • Ambang Batas Relevansi — Hasil di bawah ambang batas (default 0,25) disaring. Visualisasi skor: hijau untuk kecocokan kuat, kuning untuk sedang, merah untuk lemah.
  • Sistem Sitasi — Setiap respons AI menyertakan sitasi sumber bernomor.
  • Respons Streaming — Jawaban di-stream secara real-time melalui Server-Sent Events.

Dashboard Analitik

Pahami bagaimana knowledge base Anda digunakan.

Lima tampilan analitik memberikan visibilitas penuh terhadap query, performa, konsumsi token, dan perilaku pengguna.

  • Dashboard Ringkasan — Total query, volume query 7 hari, hari aktif, rata-rata waktu pemrosesan, total penggunaan token, breakdown penggunaan model, dan volume data dalam MB.
  • Metrik Penggunaan LLM — Lacak token input, output, dan total selama periode yang dapat dikonfigurasi (7, 14, 30, 60, atau 90 hari).
  • Insight Pencarian — Istilah pencarian unik, rata-rata kata per query, daftar query teratas, dan word cloud interaktif.
  • Analitik Pengguna — Distribusi browser, sistem operasi, tipe perangkat, dan distribusi geografis.
  • Riwayat Query — Log lengkap setiap query. Filter berdasarkan rentang tanggal. Ekspor ke CSV.

Widget Chat Tertanam

Tambahkan pencarian berbasis AI ke website mana pun dengan dua baris kode.

Widget JavaScript mandiri yang berjalan di dalam Shadow DOM — sepenuhnya terisolasi dari CSS dan JavaScript situs Anda. Tanpa dependensi. Tanpa konflik.

  • Instalasi Dua Baris — Tambahkan tag script dan panggil KBaaSWidget.init(). Selesai.
  • Tanpa Dependensi — Tidak perlu jQuery, React, atau framework apapun.
  • Isolasi Shadow DOM — Style dan script widget sepenuhnya terenkapsulasi.
  • API Programatiksearch(), clear(), destroy(), setTheme(), configure(), getConfig().
  • Deteksi Tema Otomatis — Secara otomatis menyesuaikan dengan preferensi OS pengguna (terang atau gelap).
  • Desain Responsif — Empat breakpoint memastikan widget tampil sempurna di setiap ukuran layar.
  • Rendering Markdown — Respons AI dirender dengan dukungan Markdown penuh.
  • Shortcut KeyboardCtrl+Enter untuk mengirim query.

Manajemen Pengguna & Akses

Empat peran. Hak akses granular. Jejak audit lengkap.

  • Empat Peran — Superadmin (akses penuh), Manager (konten + pengaturan), Editor (konten saja), Analyst (analitik dan pengaturan read-only).
  • Manajemen Pengguna — Buat, edit, nonaktifkan, dan hapus akun pengguna. Hanya Superadmin.
  • Log Aktivitas — Setiap request POST, PUT, dan DELETE dicatat otomatis. Password login disunting. Retensi 120 hari. Dapat difilter dan diekspor ke CSV.

Konfigurasi & Tuning

Atur setiap aspek perilaku AI tanpa menyentuh kode.

PengaturanRentangDefault
Provider LLMAWS Bedrock / Google GeminiBedrock
Metode PencarianHybrid / Vektor Saja / BM25 SajaHybrid
Bobot Vektor vs BM250–100% masing-masing50 / 50
Temperatur0,0 – 1,00,3
Ambang Batas Relevansi0,0 – 1,00,25
Hasil Top-K1 – 5015
Ukuran Chunk100 – 10.000 karakter500
Overlap Chunk0 – 5.000 karakter50
Gunakan LLMAktif / NonaktifAktif
StreamingAktif / NonaktifAktif

Alat Developer & Debug

Saat ada yang tidak beres, lihat persis apa yang dilakukan mesin pencarian.

  • Search Debugger — Jalankan query mana pun melalui seluruh pipeline dan inspeksi semua 9 tahapan: konfigurasi, deteksi kata kunci, ringkasan hasil, jendela konteks LLM, penyaringan ambang batas, skor fusi hybrid, hasil vektor mentah, hasil BM25 mentah, dan hasil data terstruktur.
  • VectorDB Viewer — Telusuri setiap record di vector database. Filter berdasarkan tipe sumber. Hanya Superadmin.

Infrastruktur

  • SQLite untuk metadata dokumen, harga, spesifikasi produk, pengaturan, dan akun pengguna — tidak memerlukan server database eksternal
  • ChromaDB untuk vector embedding — berjalan embedded, tidak perlu layanan terpisah
  • MySQL untuk riwayat query dan log aktivitas — opsional, untuk analitik volume tinggi
  • FastAPI backend dengan dukungan async dan Server-Sent Events untuk streaming
  • Migrasi fully idempotent — jalankan migrate_server.py di setiap deploy, aman untuk diulang

BABEH dibangun untuk tim yang membutuhkan asisten AI berbasis knowledge base tanpa kerumitan membangunnya dari nol.
Coba sekarang →