GPT Computer Assistant

Upsonic/gpt-computer-assistant

5204

490

505

MIT

Editor Fav

Intelligence development framework in python for your product like Apple Intelligence

2024-05-26EDT 16:May:th-14400

2024-10-01EDT 19:Oct:st-14400

. .

Intelligence development framework

|ENGLISH|[简体中文](README.zh_CN.md)|[正體中文](README.zh_TW.md)|[TÜRKÇE](README.TR.md) # GPT Computer Assistant(GCA) Hi, this is an alternative work for providing ChatGPT MacOS app to Windows and Linux. In this way this is a fresh and stable work. You can easily install as Python library for this time but we will prepare a pipeline for providing native install scripts (.exe). Powered by Upsonic Tiger 🐅 A function hub for llm agents. ## 1. Install and run **Python 3.10 or 3.11 is required** ```console pip install 'gpt-computer-assistant[base]' pip install 'gpt-computer-assistant[api]' ``` To run gpt-computer-assistant, simply type ```console computerassistant --api ```

## 2. LLM Settings ```python from gpt_computer_assistant.remote import remote remote.save_models("gpt-4o") remote.save_openai_api_key("sk-**") ```

## 3. Characteristic API ```python # Name of the assitant: remote.change_name("X Intelligence") #Developer personna of the assistant: remote.change_developer("X Company") ```

## 4. Connect Your Functions API ```python # Installing an library: remote.install_library("numpy") # Adding functianility as python functions: @remote.custom_tool def my_server_status() -> bool: """ Check the server status. """ return True ```

## 5. Interact with User API ### remote.input Talk with assistant, about user and computer. With this api you can create an consulting process. ```markdown `Hi, look to user window and return which app using now` `Ask user to is user need any kind of supoprt` `Extract the user redis config file.` ``` With this questions you will make a shortcut for your needs. **You can collect informations from user computer or directly from user or user computer.** ```python output = remote.input("Extract the user redis config file.", screen=False) print(output) ```

. .

## Usage ![options](https://github.com/onuratakan/gpt-computer-assistant/assets/41792982/37d34745-ae4b-4b37-9bfa-aec070c97897) ### Use cases Screenshot 2024-08-13 at 18 33 52

## Roadmap | Feature | Status | Target Release | |---------------------------------|--------------|----------------| | Clear Chat History | Completed | Q2 2024 | | Long Audios Support (Split 20mb) | Completed | Q2 2024 | | Text Inputs | Completed | Q2 2024 | | Just Text Mode (Mute Speech) | Completed | Q2 2024 | | Added profiles (Different Chats) | Completed | Q2 2024 | | More Feedback About Assistant Status | Completed | Q2 2024 | | Local Model Vision and Text (With Ollama, and vision models) | Completed | Q2 2024 | | **Our Customizable Agent Infrastructure** | Completed | Q2 2024 | | Supporting Groq Models | Completed | Q2 2024 | | **Adding Custom Tools** | Completed | Q2 2024 | | Click on something on the screen (text and icon) | Completed | Q2 2024 | | New UI | Completed | Q2 2024 | | Native Applications, exe, dmg | Completed | Q3 2024 | | **Collaborated Speaking Different Voice Models on long responses.** | Completed | Q2 2024 | | **Auto Stop Recording, when you complate talking** | Completed | Q2 2024 | | **Wakeup Word** | Completed | Q2 2024 | | **Continuously Conversations** | Completed | Q2 2024 | | **Adding more capability on device** | Completed | Q2 2024 | | **Local TTS** | Completed | Q3 2024 | | **Local STT** | Completed | Q3 2024 | | Tray Menu | Completed | Q3 2024 | | **Global Hotkey** | On the way | Q3 2024 | | DeepFace Integration (Facial Recognition) | Planned | Q3 2024 | ## Capabilities At this time we have many infrastructure elements. We just aim to provide whole things that already in ChatGPT app. | Capability | Status | |------------------------------------|----------------------------------| | **Local LLM with Vision (Ollama)** | OK | | Local text-to-speech | OK | | Local speech-to-text | OK | | **Screen Read** | OK | | **Click to and Text or Icon in the screen** | OK | | **Move to and Text or Icon in the screen** | OK | | **Typing Something** | OK | | **Pressing to Any Key** | OK | | **Scrolling** | OK | | **Microphone** | OK | | **System Audio** | OK | | **Memory** | OK | | **Open and Close App** | OK | | **Open a URL** | OK | | **Clipboard** | OK | | **Search Engines** | OK | | **Writing and running Python** | OK | | **Writing and running SH** | OK | | **Using your Telegram Account** | OK | | **Knowledge Management** | OK | | **[Add more tool](https://github.com/onuratakan/gpt-computer-assistant/blob/master/gpt_computer_assistant/standard_tools.py)** | ? | ### Predefined Agents If you enable it your assistant will work with these teams: | Team Name | Status | |------------------------------------|----------------------------------| | **search_on_internet_and_report_team** | OK | | **generate_code_with_aim_team_** | OK | | **[Add your own one](https://github.com/onuratakan/gpt-computer-assistant/blob/master/gpt_computer_assistant/teams.py)** | ? |

## Contributors