Foundation Models & Custom Models

Foundation models are the building blocks of AI in the Dhenara framework. These pre-configured models encapsulate the capabilities of various AI providers and enable consistent interactions regardless of the underlying API differences.

Before proceeding further, please note that, you are not dependent on the Foundation Model in the package. We will update this library as and when new models are published, but you can always create your own FoundationModel objects or even custom AIModel objects and use them in the same manner as described below.

Understanding Foundation Models

In Dhenara, a foundation model is a pre-configured representation of a specific AI model from a provider (like OpenAI, Google AI, Anthropic, etc.) with standardized properties:

Model identification information
Context/token limitations
Cost data for usage tracking
Configuration options
Provider-specific parameters

These foundation models ensure consistent behavior while abstracting away provider-specific implementation details.

Using Foundation Models

You can access foundation models using their respective constants or through the provided APIs:

from dhenara.ai.types.genai.foundation_models import ALL_CHAT_MODELS, ALL_IMAGE_MODELS
from dhenara.ai.types.genai.foundation_models import OPENAI_CHAT_MODELS, ANTHROPIC_CHAT_MODELS
from dhenara.ai.types.genai.foundation_models.fns import FoundationModelFns

# Get all foundation models
for model in ALL_CHAT_MODELS:
    print(f"Model: {model.display_name}, Provider: {model.provider}")

# Get a specific foundation model by name
claude_model = FoundationModelFns.get_foundation_model("claude-3-5-sonnet")

Cross-Provider Model Compatibility

One of the most powerful features of Dhenara's foundation model system is the ability to use the same model across different API providers through model cloning and customization.

Example: Using Claude 3.5 on Amazon Bedrock

Anthropic's Claude models can be accessed through Anthropic's direct API or via Amazon Bedrock. Here's how to adapt a foundation model for a different provider:

from dhenara.ai import AIModelClient
from dhenara.ai.types import AIModelEndpoint
from dhenara.ai.types.external_api import AIModelAPIProviderEnum
from dhenara.ai.types.genai import AIModelAPI, ChatModelCostData
from dhenara.ai.types.genai.foundation_models.anthropic.chat import Claude35Sonnet

# Initialize API configuration for Amazon Bedrock
bedrock_api = AIModelAPI(
    provider=AIModelAPIProviderEnum.AMAZON_BEDROCK,
    credentials={
        "access_key_id": "your_access_key_id",
        "secret_access_key": "your_secret_access_key",
    },
    config={"region": "your_region"},
)

# Clone the foundation model and customize for Amazon Bedrock
bedrock_claude35_sonnet = Claude35Sonnet.clone(
    # Use the Bedrock-specific model name
    model_name="us.anthropic.claude-3-5-sonnet-20241022-v2:0",
)
# NOTE: Remove the version suffix used by Anthropic's direct API
bedrock_claude35_sonnet.metadata["version_suffix"] = None

# Optionally override cost data for accurate usage tracking
bedrock_claude35_sonnet.cost_data = ChatModelCostData(
    input_token_cost_per_million=3.0,
    output_token_cost_per_million=15.0,
)

# Create a model endpoint connecting the model with the API
model_endpoint = AIModelEndpoint(
    api=bedrock_api,
    ai_model=bedrock_claude35_sonnet,
)

# Create the client and use it
client = AIModelClient(
    model_endpoint=model_endpoint,
    is_async=False,
)

Creating Custom Foundation Models

You can create completely custom models for specialized use cases:

from dhenara.ai.types.genai.ai_model import (
    AIModelFunctionalTypeEnum,
    AIModelProviderEnum,
    ChatModelCostData,
    ChatModelSettings,
    FoundationModel,
)

# Create a custom foundation model
custom_model = FoundationModel(
    model_name="your-custom-model",
    display_name="Your Custom Model",
    provider=AIModelProviderEnum.CUSTOM,
    functional_type=AIModelFunctionalTypeEnum.TEXT_GENERATION,
    settings=ChatModelSettings(
        max_context_window_tokens=100000,
        max_output_tokens=8000,
    ),
    valid_options={
        # Define your model's valid options here
    },
    metadata={
        "details": "Your custom model's description",
    },
    cost_data=ChatModelCostData(
        input_token_cost_per_million=1.0,
        output_token_cost_per_million=5.0,
    ),
)

Model Settings and Configuration

Foundation models include detailed settings that specify their capabilities:

Text Generation Models

ChatModelSettings(
    max_context_window_tokens=200000,  # Maximum tokens in the context window
    max_output_tokens=8192,           # Maximum output tokens
    supports_reasoning=True,          # Whether reasoning mode is supported
    max_reasoning_tokens=32000,       # Maximum tokens for reasoning (if supported)
)

Image Generation Models

ImageModelSettings(
    max_words=4000,  # Maximum words in prompt
)

Cost Tracking

Foundation models include cost data for accurate usage tracking:

ChatModelCostData(
    input_token_cost_per_million=3.0,      # Cost per million input tokens
    output_token_cost_per_million=15.0,    # Cost per million output tokens
    cost_multiplier_percentage=10.0,       # Optional markup percentage
)

Readily Available Foundation Models

Dhenara includes a rich collection of pre-configured foundation models. But remember that you can always create your own Models, the functional part is independent from the FoundationModel collection.

Text Generation

GPT-4o
GPT-4o-mini
O1
O1-mini
O3-mini
Gemini 2 Flash
Gemini 2 Flash Lite
Gemini 1.5 Flash
Gemini 1.5 Pro
Claude 3.7 Sonnet
Claude 3.5 Sonnet
Claude 3.5 Haiku
Claude 3 Opus
DeepSeek-R1

Image Generation

DALL-E 2
DALL-E 3
Imagen 3
Imagen 3 Fast

Conclusion

Foundation models in Dhenara provide a powerful abstraction layer that allows you to:

Work with multiple AI providers through a consistent interface
Adapt models to work with different API providers
Track usage and costs accurately
Customize models for specific use cases

By leveraging foundation models, you can create robust AI applications that aren't tied to a specific provider while maintaining full control over model settings and costs.

Understanding Foundation Models​

Using Foundation Models​

Cross-Provider Model Compatibility​

Example: Using Claude 3.5 on Amazon Bedrock​

Creating Custom Foundation Models​

Model Settings and Configuration​

Text Generation Models​

Image Generation Models​

Cost Tracking​

Readily Available Foundation Models​

Text Generation​

Image Generation​

Conclusion​