Use Parsera LLM Specs API instead of `capabilities.rb` #132

crmne · 2025-04-22T10:32:42Z

Removed the OpenAI capabilities module and related methods.
Simplified model parsing for OpenAI and Gemini providers, focusing on essential attributes.
Introduced a new utility method for deep key symbolization in hashes.
Updated OpenRouter model parsing to enhance data extraction and structure.
Deleted obsolete browser and CLI helper tasks to streamline the codebase.
Consolidated model updater logic into the Rake task for better maintainability.
Improved error handling and logging throughout the model update process.

- Removed the OpenAI capabilities module and related methods. - Simplified model parsing for OpenAI and Gemini providers, focusing on essential attributes. - Introduced a new utility method for deep key symbolization in hashes. - Updated OpenRouter model parsing to enhance data extraction and structure. - Deleted obsolete browser and CLI helper tasks to streamline the codebase. - Consolidated model updater logic into the Rake task for better maintainability. - Improved error handling and logging throughout the model update process.

jayelkaake · 2025-04-22T13:25:43Z

lib/ruby_llm/utils.rb

+  module Utils
+    module_function
+
+    def symbolize_keys_deep(hash)


This is like ActiveSupport's deep_symbolize_keys, which I also needed in 2 PRs here and here

You'll want to cover Array child items though, so I recommend just taking my/ActiveSupport's implementation and putting it here, then I'll refactor my PRs to use your Util method instead.

jayelkaake · 2025-04-22T13:40:30Z

lib/ruby_llm/models.rb

+      end
+
+      def fetch_parsera_models
+        response = Faraday.new('https://api.parsera.org/v1/llm-specs', request: { timeout: 60 })


I don't like the idea of a core dependency on a commercial 3rd party's API, though very much appreciate the value that Parsera is doing by standardizing LLM capabilities into a JSON.

I know the code isn't called during any main runtime functions and also appreciate the support from Parsera (seems like great company).

Maybe just move this code to the rake task? Don't feel strongly about this so if you don't want to that's fine.

jayelkaake · 2025-04-22T13:43:30Z

Love this - the initiative by Parsera to standarize capabilities of models is fantastic!

Although, I'm not confident that the LLMs are going to follow the same configs within the same capabilities so I question whether the capabilities should just be implemented in the provider classes (and if they're not implemented it means they aren't supported).

For example, structured output is supported by Gemini but it is a vastly different implementation than the other LLMs.

Anyways, still prob good to do this since it's just a better implementation of how the gem already works today.

…emspec description

This PR implements the ability to specify custom dimensions when generating embeddings, as requested in issue #47. ### What's included - Added support for passing a dimensions parameter to the embed method - Implemented dimensions handling in both OpenAI and Gemini providers - Added tests to verify dimension param works correctly - Optimized the Gemini provider's `embed` method to reduce unnecessary API calls when embedding texts, resulting in lower token usage. From now on, it uses `batchEmbedContents` endpoint within one request, for both single and multiple text embeddings. - Modernize Gemini embeddings following DIP principle, as implemented in `openai/embeddings.rb`. - The Gemini embeddings API response does not contain the promptTokenCount attribute, so I have removed it. ### Implementation notes I've decided to only implement the per-request dimension configuration and not the global configuration option that was initially proposed in the issue. This is because each embedding model has its own default dimensions, making a global setting potentially confusing. With this implementation, users can set the embedding dimensions like: ```ruby embedding = RubyLLM.embed( "Ruby is a programmer's best friend", model: "text-embedding-3-small", dimensions: 512 ) ``` ### References - OpenAI API docs: https://platform.openai.com/docs/api-reference/embeddings - Gemini API docs: https://ai.google.dev/api/embeddings Resolves #47 --------- Co-authored-by: Carmine Paolino <[email protected]>

…s_chat Previously, API errors during a chat turn would leave an orphaned, empty assistant message record. This caused issues, notably with Gemini rejecting subsequent requests containing empty messages. Fixes #118

…I calls

…lution in chat, embedding, and image methods Now `assume_model_exists` can be used in `paint` and `embed` methods too.

…al development support Closes #137

…ibuting guidelines

…M 4 and GLM Z1 models

Now uses POROs and simplifies implementation of providers. Fixes #143

…llama provider - Introduced `RubyLLM::Providers::Ollama::Media` module to manage media content formatting for OpenAI APIs. - Implemented `format_content` method to process text and attachments, including images, PDFs, and audio. - Added `format_image` method to convert image attachments into the required format.

…AI models with detailed descriptions

…rray Closes #145

…#151) This updates the acts_as_message, acts_as_chat and acts_as_tool class methods to use Rails-style foreign keys whenever custom class names are used as options. For example: ``` class FooMessage < ActiveRecord::Base acts_as_message chat_class: 'FooChat', tool_call_class: 'FooToolCall' end ``` will now set the foreign key on the `belongs_to :chat` association to be `foo_chat_id`, instead of `chat_id`, and will set the foreign key on `belongs_to :parent_tool_call` association to `foo_tool_call_id` instead of just `tool_call_id`. This is consistent with Rails' naming conventions for class names and foreign keys. Changes are backwards-compatible with existing code/behavior, and don't require a major or minor version bump. Updated test cases to ensure that the associations are working, but didn't re-record VCR tests, since I don't have an OpenAPI key. Closes #150 Co-authored-by: Carmine Paolino <[email protected]>

## Purpose Introduces some basic configuration for the logging capabilities, allowing users to specify a custom log file and log level. This feature enhances debugging and monitoring capabilities by providing more control over where and how logs are recorded. ## Implementation Details - Added log_file configuration option (default: STDOUT) - Added log_level configuration option (default: INFO, or DEBUG if RUBYLLM_DEBUG env var is set) - Updated logger initialisation to use the configured log file and level - Added documentation for logging configuration - Maintained backward compatibility with existing logging behaviour ## Usage Example ```ruby # Global configuration RubyLLM.configure do |config| config.log_file = '/logs/ruby_llm.log' # Custom log file location config.log_level = :debug # Set log level end ``` ## Testing Manual - Verified logger initialisation with custom log file - Confirmed log level changes based on configuration - Tested environment variable override for debug level - Validated default behaviour (STDOUT logging) ## Documentation Added a new section in configuration.md for logging settings Co-authored-by: Carmine Paolino <[email protected]>

Closes #141

- Removed the OpenAI capabilities module and related methods. - Simplified model parsing for OpenAI and Gemini providers, focusing on essential attributes. - Introduced a new utility method for deep key symbolization in hashes. - Updated OpenRouter model parsing to enhance data extraction and structure. - Deleted obsolete browser and CLI helper tasks to streamline the codebase. - Consolidated model updater logic into the Rake task for better maintainability. - Improved error handling and logging throughout the model update process.

crmne marked this pull request as draft April 22, 2025 10:34

crmne changed the title ~~Refactor RubyLLM provider capabilities and model parsing to use Parsera~~ Use Parsera LLM Specs API instead of capabilities.rb Apr 22, 2025

jayelkaake reviewed Apr 22, 2025

View reviewed changes

crmne and others added 24 commits May 6, 2025 19:00

Fixed Bedrock Anthropic errors

7843ce7

Ollama support

5b694ca

fix: add dummy OLLAMA_API_BASE environment variable for test execution

91f3776

docs: enhance README and guides for clarity and consistency; update g…

c46e6fa

…emspec description

feat: configuration inspect filters sensitive data

9d88029

fix(docs): Improve error handling in Rails integration example for AP…

ba72f36

…I calls

docs(embeddings): add note about embedding dimensions coming in 1.3.0

85caba9

updated model registry

a7fdb87

refactor: Introducing Models.resolve to simplify and unify model reso…

25e5787

…lution in chat, embedding, and image methods Now `assume_model_exists` can be used in `paint` and `embed` methods too.

fix(spec_helper): Update OLLAMA_API_BASE default value for better loc…

39a8524

…al development support Closes #137

docs(available-models): Update last updated date to 2025-04-25

c99f971

docs(index): Remove outdated sections on learning resources and contr…

977df11

…ibuting guidelines

fix(models): Update token limits and pricing for Llama models; add GL…

35cd9f0

…M 4 and GLM Z1 models

fix(docs): Update plausible script to use my instance

72f3746

Completely refactored Content implementation.

0d344dc

Now uses POROs and simplifies implementation of providers. Fixes #143

Updated models

0c99cdf

fix(models): Update available models documentation and add new Arcee …

eaaf21b

…AI models with detailed descriptions

Fixed Calling chat.to_llm keeps appending messages to the message a…

02837f3

…rray Closes #145

refactor(media): streamline content formatting methods across providers

4d52336

crmne added 2 commits May 6, 2025 19:03

Fixes #embed fails when using default embedding model

08d4477

Closes #141

crmne closed this May 6, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use Parsera LLM Specs API instead of `capabilities.rb` #132

Use Parsera LLM Specs API instead of `capabilities.rb` #132

crmne commented Apr 22, 2025

jayelkaake Apr 22, 2025

jayelkaake Apr 22, 2025

jayelkaake commented Apr 22, 2025 •

edited

Loading

Use Parsera LLM Specs API instead of capabilities.rb #132

Use Parsera LLM Specs API instead of capabilities.rb #132

Conversation

crmne commented Apr 22, 2025

jayelkaake Apr 22, 2025

Choose a reason for hiding this comment

jayelkaake Apr 22, 2025

Choose a reason for hiding this comment

jayelkaake commented Apr 22, 2025 • edited Loading

Use Parsera LLM Specs API instead of `capabilities.rb` #132

Use Parsera LLM Specs API instead of `capabilities.rb` #132

jayelkaake commented Apr 22, 2025 •

edited

Loading