agent-smith

Spawn endless swarms of GPT4 agents using LangChain to scan for vulnerabilities in your software!

White-hat hacker agents styret af GPT4 (3.5)

Projekt idé (Originalt af Morten)

Brug GPT4 til at automatisere en 'whitehacking' bot. GPT4 udvides til at kunne eksekvere pen-testing. Dvs. at vi bruger GPT4 API'et til at definere kommandoer som vi eksekverer i en terminal, og responset giver vi så til GPT4, som så giver en ny kommando osv. Vi kunne lade os inspirere af https://github.com/Significant-Gravitas/Auto-GPT. Det kunne også være en mulighed at bruge https://js.langchain.com/docs/.

Først og fremmest skal vi lege med OpenAI's GPT API for at se hvad det kan!

Gode udfordringer vi står overfor:

(CLI tool) Hvordan skal værktæjet tage imod target (api url, website etc.) og goal (målet med undersøgelse/forsøg på sikekrhedsbrud)?
(Prompts) Hvilke forskellige prompts (pre-prompts) skal vi bruge?
- Forskellige agents skal kunne forskellige ting (evt. samarbejde?). Det kunne f.eks. agents til hver af OWASPs top 10 Web Application Security Risks OWAPS Top10
- F.eks. Broken Access Control, sql-injection, Misconfiguration, Vuln. and outdated components etc. Port Scanning..
(Command Executor) Hvad gør vi med long-running commands (nmap uden specifik port)?
- Kan vi køre commands sikkert uden af flå vores computer fra hinanden?
Skal skal prøve at bruge LangChains "Agent" til at spawne forskellige GPT agents der kan hjælpe hinanden eller samarbejde om et target. (Tænk: spawn 100 agents der pinger et api med forskellige tools)
Hvordan håndtere vi "known" viden som f.eks. CVE register?
Skal vi gøre det i Python eller JS?
- LangChain JS vs. Python Integrations
- LangChain JS vs. Python Features
Hvordan skal vi lave search i allerede kendt materiale?

Illustration af ide:

flowchart TD
    subgraph White-hat hacker agent 'smith'
    direction TB
    subgraph Open AI API
    G[GPT4]
    end

    subgraph LangChain
    direction RL
    B[Agent A]
    E((Promts))
    E --> |3. Relevant Promt| B
    B --> |2. Goals A| E
    F[CLI tool]
    F --> |1. Enter Goal and Target| B
    B --> |?. search| C[(Indexes:\nCVE register)]
    C --> |?. Known vuln.| B
    D[Command Executor]
    B --> |6. Execute Command A| D
    D --> |7. Command Output A| B
    end

    B -->|4. Task A| G
    G -->|5. Command A| B
    end

BWAPP writeups

https://wooly6bear.files.wordpress.com/2016/01/bwapp-tutorial.pdf

Preprompts

pre_prompt = """Hello, I am a system which reponds with the output of terminal commands. \n You must act as pentester who is allowed to test the security of the system.\n You have access to these tools ONLY: curl, nmap, sqlmap and python interpreter.\n Provide me with one approriate command to gain knowledge about the target. \n The command must be provided in one line, without additional text and all commands should use non-interactive mode and accept all questions. \n I will respond with the output of the command. \n If the response I give you is empty, it might be because the given command should be executed with a specific flag. \n This flag can be found with the command [program] -h or [program] -help or [program] --help. \n An example of such is: curl -v http://someurl.com, -v indicates the response should be verbose (print to terminal) in curl. \n Tools such as sqlmap often requires further interactions by writing "y", "n" in the terminal. Please make sure this is not needed, by adding the appropriate accept all flag.

"""

Name		Name	Last commit message	Last commit date
Latest commit History 75 Commits
app		app
cves		cves
db		db
.env.example		.env.example
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
docker-compose.yml		docker-compose.yml
packages.txt		packages.txt
requirements.txt		requirements.txt
start.sh		start.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

agent-smith

White-hat hacker agents styret af GPT4 (3.5)

BWAPP writeups

Preprompts

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

agent-smith

White-hat hacker agents styret af GPT4 (3.5)

BWAPP writeups

Preprompts

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages