langchain_fllama

A bridge between Fllama and Langchain for Dart. This package enables on-device inference with any .gguf model and allows you to create powerful pipelines using Langchain. Fllama is built on llama.cpp, bringing its capabilities to Flutter applications.

Roadmap

Run inference on-device with any .gguf model
Use ChatFllama for chat-based models
Tool calling support
Pass tool call output back to the model
Manual control for model loading/unloading
Support for image inputs
Output formatting support (e.g. JSON)

Installation

Add the following to your pubspec.yaml:

dependencies:
  langchain_fllama:
    git:
      url: https://github.com/breitburg/langchain_fllama
      ref: main

Note: This package cannot be published to pub.dev as it relies on native library bindings.

Usage

Basic Example

First, obtain a .gguf model. You can find compatible models on Hugging Face Hub. For a complete list of supported models, refer to the llama.cpp README.

You can load models in several ways:

Download at runtime and save to the device (e.g., in the cache directory using path_provider)
Let users select models from their device using file_picker

final modelPath = 'path/to/model.gguf';

final chat = ChatFllama(
    defaultOptions: ChatFllamaOptions(model: modelPath),
);

final prompt = PromptValue.string('What is the capital of France?');

final response = await chat.invoke(prompt);
print(response.outputAsString);
// Output: Paris

Streaming and Multi-message Prompts

final modelPath = 'path/to/model.gguf';

final chat = ChatFllama(
    defaultOptions: ChatFllamaOptions(model: modelPath),
);

final prompt = ChatPromptValue([
    HumanChatMessage(
        content: ChatMessageContent.text('Remember the number 5123.'),
    ),
    AIChatMessage(
        content: 'Sure, I will remember the number.',
    ),
    HumanChatMessage(
        content: ChatMessageContent.text('What is the number?'),
    ),
]);

await for (final part in chat.stream(prompt)) {
    print(part.outputAsString);
}
// Output: The number is 5123.

Tool Integration

final modelPath = 'path/to/model.gguf';

const weatherTool = ToolSpec(
    name: 'get_current_weather',
    description: 'Get the current weather in a given location',
    inputJsonSchema: {
        'type': 'object',
        'properties': {
            'location': {
                'type': 'string',
                'description': 'The city and state, e.g. San Francisco, CA',
            },
        },
        'required': ['location'],
    },
);

final chat = ChatFllama(
    defaultOptions: ChatFllamaOptions(
        model: modelPath,
        tools: const [weatherTool],
    ),
);

final prompt = PromptValue.string('What\'s the weather in Leuven, Belgium?');

final response = await chat.invoke(prompt);

for (final call in response.output.toolCalls) {
    print('Tool call ${call.name} in ${call.arguments['location']}');
}
// Output: Tool call get_current_weather in Leuven, Belgium

Note: While streaming with tools is supported, it's recommended to use the invoke method when implementing tool calls. The tool call JSON will initially appear as regular text in the output stream until the model completes generating the JSON.

Contributing

Contributions are welcome! Please follow the Conventional Commits specification when creating pull requests.

License

This project is licensed under the MIT License. See the LICENSE file for details.

Acknowledgments

Telosnex - Creator of Fllama
Ggerganov - Creator of llama.cpp
davidmigloz - Creator of Langchain for Dart

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
example		example
lib		lib
.flutter-plugins		.flutter-plugins
.flutter-plugins-dependencies		.flutter-plugins-dependencies
.gitignore		.gitignore
.metadata		.metadata
LICENSE		LICENSE
README.md		README.md
analysis_options.yaml		analysis_options.yaml
pubspec.yaml		pubspec.yaml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

langchain_fllama

Roadmap

Installation

Usage

Basic Example

Streaming and Multi-message Prompts

Tool Integration

Contributing

License

Acknowledgments

About

Languages

License

breitburg/langchain_fllama

Folders and files

Latest commit

History

Repository files navigation

langchain_fllama

Roadmap

Installation

Usage

Basic Example

Streaming and Multi-message Prompts

Tool Integration

Contributing

License

Acknowledgments

About

Topics

Resources

License

Stars

Watchers

Forks

Languages