TASE

TASE

Foreword

TASE (Telegram Audio Search Engine): A lightening-fast audio search engine python library on top of the Telegram messenger platform

GitHub Repo stars PyPI - Downloads

Abstract

TASE allows users to quickly and easily find information that is of genuine interest or value, without the need to wade through numerous irrelevant Telegram channels. It provides users with search results that lead to relevant information on high-quality channels.

It is a growing open source full-text audio search engine platform that serves high-volume requests from users. Based on Python and Telegram, the latest major update introduces many new features among which a highly abstracted and modular design pattern powered by Elasticsearch and ArangoDB with support for parallel clusters on different servers located in different parts of the world.

TASE at a glance

  • Advanced full-text search engine for audio files
  • Extremely fast audio file indexer (benchmark: minimum 4 million songs per day per client)
  • Support for multiple parallel clients as indexer
  • Support for distributed parallel clusters on multiple servers (searching and indexing) (all audio files, graph and document models)
  • Graph of users and items
  • Dynamic URLs
  • Asynchronous
  • Reach admin tools
  • Multilingual
  • Audio file caching
  • Easy configuration and customization
  • Friendly look and feel

Features

  • Search engine
  • direct bot search
  • search from groups and private chats
  • Real-time, Search based on file-name, performer name, audio-name, and lyrics
  • High accuracy and relevance
  • Multilingual search
  • Returns reference
  • Indexing features
  • Optimistic crawling
  • Scoring channel quality
  • banning-proof
  • ID hashing
  • Interactive indexing
  • Fast indexing, User controlling features
  • User interface
  • Playlists
  • Admin features
  • Developer features
  • Documentation
  • Exception safe
  • Extremely fast
  • Monitoring logs
  • Handles RTL langues smoothly

Tech stack and contributions

  • Elasticsearch
  • ArangoDB
  • Pyrogram
  • Python get_text
  • Celery
  • RabbitMQ
  • Redis
  • Pydantic
  • Jinja

Keywords

Search engine, Audio-search-engine, Audio-indexing, Knowledge-graph, Graph, Telegram, python