AGI Strategy - Day 1

Updated Day 1 for Your 3-Week Plan Link to heading

Here’s your revised Day 1 that incorporates Codeberg setup and a progress tracking system:

Day 1 (Monday): Environment Setup, Codeberg Configuration & Paper Reading Link to heading

Time: 3-4 hours

Tasks: Link to heading

Part A: Codeberg Account Setup (45-60 min) Link to heading

Create Codeberg Account
- Go to codeberg.org and register
- Choose a username you’re comfortable using professionally
- Verify your email address

Set up SSH Authentication (recommended over HTTPS)

# Generate SSH key if you don't have one
ssh-keygen -t ed25519 -C "your_email@example.com"

# View your public key
cat ~/.ssh/id_ed25519.pub

# Copy this key and add it in Codeberg: Settings → SSH/PGN Keys → Add Key

Configure Git locally

git config --global user.name "Your Name"
git config --global user.email "your_email@example.com"

Create your project repository
- Create a new repository on Codeberg named toxigen-mitigation (or similar)
- Initialize with a README
- Clone locally:
```
git clone git@codeberg.org:YOUR_USERNAME/toxigen-mitigation.git
cd toxigen-mitigation
```

Set up repository structure

mkdir -p data scripts results figures logs
touch .gitignore
# Add common Python gitignore entries

Part B: Progress Tracking System (30 min) Link to heading

Recommendation: Store your daily log in your Codeberg repository. This keeps everything in one place, provides version history, and creates a public record of your work. Here’s a structure:

Create PROGRESS.md in your repo root:

# Progress Log: ToxiGen Mitigation Project

## Overview
- **Goal:** Evaluate safety prompts on ToxiGen benchmark
- **Timeline:** 3 weeks (15 working days)
- **Model:** Llama 3.1 8B (local)
- **Started:** [DATE]

---

## Day 1 - [DATE]

### Plan
- [ ] Create Codeberg account and set up SSH
- [ ] Install Python dependencies
- [ ] Install Ollama and download Llama 3.1 8B
- [ ] Verify PyTorch MPS
- [ ] Read ToxiGen paper

### What I Did
*Fill in as you work...*

### Got Stuck On
*Document blockers here...*

### Solutions/Workarounds
*How you resolved issues...*

### Tomorrow's Carryover
*Anything to continue next day...*

### Time Spent
- Codeberg setup: X hours
- Environment: X hours
- Paper reading: X hours
- **Total:** X hours

---

## Day 2 - [DATE]

### Plan
- [ ] Load ToxiGen dataset
- [ ] Explore structure
- [ ] Select subset for evaluation
...

*Continue template for each day*

Why Codeberg for progress tracking:

Single source of truth for your project
Git history shows your progress over time
Can reference specific commits when issues arise
Public visibility creates accountability
Easy to include code snippets in markdown

Part C: Python Environment Setup (30-45 min) Link to heading

Create virtual environment

python -m venv venv
source venv/bin/activate  # On macOS

Install dependencies

pip install torch transformers datasets detoxify pandas matplotlib seaborn jupyter

Install Ollama and download Llama 3.1 8B

# Download Ollama from ollama.ai
# Then:
ollama pull llama3.1:8b

Verify PyTorch MPS (Metal) is working

import torch
print(f"MPS available: {torch.backends.mps.is_available()}")
print(f"MPS built: {torch.backends.mps.is_built()}")

Part D: Paper Reading (60-90 min) Link to heading

Read the ToxiGen paper (Hartvigsen et al., ACL 2022)
Focus on: methodology, evaluation sections, benchmark construction
Take notes in your PROGRESS.md file

Deliverables: Link to heading

✅ Codeberg account created and configured
✅ Repository cloned with proper structure
✅ PROGRESS.md file created with Day 1 entry
✅ Working Python environment
✅ Llama 3.1 running locally via Ollama
✅ Notes on ToxiGen approach documented

Code Checkpoint: Link to heading

import torch
from transformers import AutoModelForCausalLM, AutoTokenizer
from detoxify import Detoxify

# Verify MPS
print(f"MPS available: {torch.backends.mps.is_available()}")
print(f"MPS built: {torch.backends.mps.is_built()}")

# Test toxicity detector
toxicity_model = Detoxify('original')
result = toxicity_model.predict("This is a test sentence")
print(result)

# Test Ollama connection
import requests
response = requests.post('http://localhost:11434/api/generate', 
    json={'model': 'llama3.1:8b', 'prompt': 'Hello', 'stream': False})
print(response.json())

End of Day Git Commit: Link to heading

git add .
git commit -m "Day 1: Environment setup, Codeberg config, initial structure"
git push origin main

Summary of Changes to Day 1 Link to heading

Original	Updated
No Codeberg mention	Full Codeberg setup with SSH
No progress tracking	`PROGRESS.md` template in repo
2-3 hours	3-4 hours (more realistic)
Basic deliverables	Includes repo structure and tracking system

The progress tracking template includes:

Plan: Daily checkboxes for tasks
What I Did: Narrative of actual work
Got Stuck On: Blockers and challenges
Solutions: How you resolved them
Carryover: Items for next day
Time Spent: Actual time tracking

This gives you a reproducible work log that will be invaluable when writing up your results in Week 3!