Commit Graph

18 Commits

Author SHA1 Message Date
admin 65537c4174 removed inference limit code 2026-01-21 13:41:03 -06:00
admin b9b07320bc feat: robust 16k/32k context management with request-only tool-call summarization and budget enforcement
- Add normalization of messages before API calls
- Implement token projection and enforce budget for 16k/32k windows
- Summarize only tool-call request arguments (not responses) when over budget
- Optionally elide redundant code blocks in old assistant messages as last-resort trimming
- Default small-model limit to 16k, large to 32k; reserve space for response tokens
- Keep core behavior and tool execution unchanged
2025-08-13 14:25:13 -05:00
admin e504407190 Fix RAG inference 2025-08-07 15:38:01 -05:00
cyclop-bot ff0770f9f4 feat: Implement pre-inference token limit check in openai_compatible_inference_bot.py 2025-06-06 14:25:15 -05:00
admin 0b9e119361 removed trailing slash 2025-06-05 18:06:13 -05:00
cyclop-bot 61fe33e1c4 feat: Add --use-large-model argument to openai_compatible_inference_bot.py
This commit introduces a new command-line argument `--use-large-model` to `openai_compatible_inference_bot.py`. When this argument is provided, the bot will initialize and use the large model (as configured via environment variables) by default, instead of the small model. This allows for easier testing and deployment of the large model from the command line.

Fixes #224
2025-06-05 18:03:18 -05:00
admin 438a940660 Updated filtering 2025-06-03 17:36:26 -05:00
admin 67f83fd3c7 Added better instructions 2025-06-03 17:32:19 -05:00
cyclop-bot 9c9b5991b6 Implement get_bot_status method to report enabled model and prompt path 2025-06-03 14:04:27 -05:00
admin 668d14977a Added return message for model switching 2025-06-03 13:54:38 -05:00
admin f15228fa58 Refactored gemini, openai and claude into one file and removed logic from the base class, also made helper class definable from command line 2025-06-03 13:04:42 -05:00
admin bd0ce3e340 Fixed CI tool and upped tool call limit 2025-06-02 19:35:41 -05:00
admin a0e590f76f Removed max history length from openai calls 2025-06-02 17:13:05 -05:00
cyclop-bot 50c92c2a63 Refactor: Generalize OpenAICompatibleInferenceBot initialization 2025-06-02 16:43:39 -05:00
admin 449ae5d3ba Merge branch 'main' of https://github.com/bucolucas/cyclop 2025-06-02 15:23:22 -05:00
admin 0eed64cbb3 Removed unnecessary awaits 2025-06-02 15:23:20 -05:00
cyclop-bot 9715e3767f Refactor: Remove unnecessary awaits from clear and abort_processing methods 2025-06-02 15:21:06 -05:00
cyclop-bot 56ffb70af0 Create OpenAICompatibleInferenceBot base class. 2025-06-02 14:56:23 -05:00