mirror of https://github.com/vegu-ai/talemate.git synced 2025-09-02 02:19:12 +00:00

mirror of https://github.com/vegu-ai/talemate lik from https://discordapp.com/channels/1246135513103204483/1267855450221711440

ai game game-development llm roleplay

Find a file

FInalWombat c36fd3a9b0 fixes character descriptions missing from dialogue prompt (#21 ) * character description is no longer part of the sheet and needs to be added separately * prep 0.10.1		2023-10-19 03:05:17 +03:00
docs	Update linux-install.md	2023-10-15 16:09:41 +03:00
scenes	Prep 0.10.0 (#12 )	2023-10-02 01:38:02 +03:00
src/talemate	fixes character descriptions missing from dialogue prompt (#21 )	2023-10-19 03:05:17 +03:00
talemate_frontend	Prep 0.10.0 (#12 )	2023-10-02 01:38:02 +03:00
templates	Prep 0.10.0 (#12 )	2023-10-02 01:38:02 +03:00
.gitignore	Update .gitignore	2023-10-09 12:39:49 +03:00
config.example.yaml	initial commit	2023-09-17 16:46:42 +03:00
install-pytorch-cuda.bat	initial commit	2023-09-17 16:46:42 +03:00
install.bat	initial commit	2023-09-17 16:46:42 +03:00
install.sh	initial commit	2023-09-17 16:46:42 +03:00
LICENSE	initial commit	2023-09-17 16:46:42 +03:00
poetry.lock	Prep 0.10.0 (#12 )	2023-10-02 01:38:02 +03:00
pyproject.toml	fixes character descriptions missing from dialogue prompt (#21 )	2023-10-19 03:05:17 +03:00
README.md	Update README.md	2023-10-18 03:28:30 +03:00
start-backend.bat	initial commit	2023-09-17 16:46:42 +03:00
start.bat	initial commit	2023-09-17 16:46:42 +03:00

README.md

Talemate

Allows you to play roleplay scenarios with large language models.

It does not run any large language models itself but relies on existing APIs. Currently supports text-generation-webui and openai.

This means you need to either have an openai api key or know how to setup oobabooga/text-generation-webui (locally or remotely via gpu renting. --api flag needs to be set)

Current features

responive modern ui
agents
- conversation
- narration
- summarization
- director
- creative
multi-client (agents can be connected to separate APIs)
long term memory (experimental)
- chromadb integration
- passage of time
narrative world state
narrative tools
creative tools
- AI backed character creation with template support (jinja2)
- AI backed scenario creation
runpod integration
overridable templates for all prompts. (jinja2)

Planned features

Kinda making it up as i go along, but i want to lean more into gameplay through AI, keeping track of gamestates, moving away from simply roleplaying towards a more game-ified experience.

In no particular order:

Extension support
- modular agents and clients
Improved world state
Dynamic player choice generation
Better creative tools
- node based scenario / character creation
Improved and consistent long term memory
Improved director agent
- Right now this doesn't really work well on anything but GPT-4 (and even there it's debatable). It tends to steer the story in a way that introduces pacing issues. It needs a model that is creative but also reasons really well i think.
Gameplay loop governed by AI
- objectives
- quests
- win / lose conditions
Automatic1111 client

Quickstart

Installation

Post here if you run into problems during installation.

Windows

Download and install Python 3.10 or higher from the official Python website.
Download and install Node.js from the official Node.js website. This will also install npm.
Download the Talemate project to your local machine. Download from the Releases page.
Unpack the download and run install.bat by double clicking it. This will set up the project on your local machine.
Once the installation is complete, you can start the backend and frontend servers by running start.bat.
Navigate your browser to http://localhost:8080

Linux

python 3.10 or higher is required.

git clone git@github.com:final-wombat/talemate
cd talemate
source install.sh
Start the backend: python src/talemate/server/run.py runserver --host 0.0.0.0 --port 5050.
Open a new terminal, navigate to the talemate_frontend directory, and start the frontend server by running npm run serve.

Configuration

OpenAI

To set your openai api key, open config.yaml in any text editor and uncomment / add

openai:
    api_key: sk-my-api-key-goes-here

You will need to restart the backend for this change to take effect.

RunPod

To set your runpod api key, open config.yaml in any text editor and uncomment / add

runpod:
    api_key: my-api-key-goes-here

You will need to restart the backend for this change to take effect.

Once the api key is set Pods loaded from text-generation-webui templates (or the bloke's runpod llm template) will be autoamtically added to your client list in talemate.

ATTENTION: Talemate is not a suitable for way for you to determine whether your pod is currently running or not. Always check the runpod dashboard to see if your pod is running or not.

Recommended Models

Note: this is my personal opinion while using talemate. If you find a model that works better for you, let me know about it.

Will be updated as i test more models and over time.

Model Name	Type	Notes
Nous Hermes LLama2	13B model	My go-to model for 13B parameters. It's good at roleplay and also smart enough to handle the world state and narrative tools. A 13B model loaded via exllama also allows you run chromadb with the xl instructor embeddings off of a single 4090.
Xwin-LM-13B	13B model	Really strong model, roleplaying capability still needs more testing
MythoMax	13B model	Similar quality to Hermes LLama2, but a bit more creative. Rarely fails on JSON responses.
Synthia v1.2 34B	34B model	Cannot be run at full context together with chromadb instructor models on a single 4090. But a great choice if you're running chromadb with the default embeddings (or on cpu).
Xwin-LM-70B	70B model	Great choice if you have the hardware to run it (or can rent it).
Synthia v1.2 70B	70B model	Great choice if you have the hardware to run it (or can rent it).
GPT-4	Remote	Still the best for consistency and reasoning, but is heavily censored. Talemate will send a general "decensor" system prompt, ymmv. If you do use this make sure to monitor your api usage, talemate tends to send a lot more requests than other roleplaying applications.
GPT-3.5-turbo	Remote	It's really inconsistent with JSON responses, plus its probably still just as heavily censored as GPT-4. If you want to run it i'd suggest running it for the conversation agent, and use GPT-4 for the other agents. If you do use this make sure to monitor your api usage, talemate tends to send a lot more requests than other roleplaying applications.

I have not tested with Llama 1 models in a while, Lazarus was really good at roleplay, but started failing on JSON requirements.

I have not tested with anything below 13B parameters.

Connecting to an LLM

On the right hand side click the "Add Client" button. If there is no button, you may need to toggle the client options by clicking this button:

Text-generation-webui

In the modal if you're planning to connect to text-generation-webui, you can likely leave everything as is and just click Save.

OpenAI

If you want to add an OpenAI client, just change the client type and select the apropriate model.

Ready to go

You will know you are good to go when the client and all the agents have a green dot next to them.

Load the introductory scenario "Infinity Quest"

Generated using talemate creative tools, mostly used for testing / demoing.

You can load it (and any other talemate scenarios or save files) by expanding the "Load" menu in the top left corner and selecting the middle tab. Then simple search for a partial name of the scenario you want to load and click on the result.

Loading character cards

Supports both v1 and v2 chara specs.

Expand the "Load" menu in the top left corner and either click on "Upload a character card" or simply drag and drop a character card file into the same area.

Once a character is uploaded, talemate may actually take a moment because it needs to convert it to a talemate format and will also run additional LLM prompts to generate character attributes and world state.

Make sure you save the scene after the character is loaded as it can then be loaded as normal talemate scenario in the future.

Further documentation

Creative mode (docs WIP)
Prompt template overrides
ChromaDB (long term memory)
Runpod Integration