AGI Strategy - Day 1
Updated Day 1 for Your 3-Week Plan Link to heading
Here’s your revised Day 1 that incorporates Codeberg setup and a progress tracking system:
Day 1 (Monday): Environment Setup, Codeberg Configuration & Paper Reading Link to heading
Time: 3-4 hours
Tasks: Link to heading
Part A: Codeberg Account Setup (45-60 min) Link to heading
-
Create Codeberg Account
- Go to codeberg.org and register
- Choose a username you’re comfortable using professionally
- Verify your email address
-
Set up SSH Authentication (recommended over HTTPS)
# Generate SSH key if you don't have one ssh-keygen -t ed25519 -C "your_email@example.com" # View your public key cat ~/.ssh/id_ed25519.pub # Copy this key and add it in Codeberg: Settings → SSH/PGN Keys → Add Key -
Configure Git locally
git config --global user.name "Your Name" git config --global user.email "your_email@example.com" -
Create your project repository
- Create a new repository on Codeberg named
toxigen-mitigation(or similar) - Initialize with a README
- Clone locally:
git clone git@codeberg.org:YOUR_USERNAME/toxigen-mitigation.git cd toxigen-mitigation - Create a new repository on Codeberg named
-
Set up repository structure
mkdir -p data scripts results figures logs touch .gitignore # Add common Python gitignore entries
Part B: Progress Tracking System (30 min) Link to heading
Recommendation: Store your daily log in your Codeberg repository. This keeps everything in one place, provides version history, and creates a public record of your work. Here’s a structure:
Create PROGRESS.md in your repo root:
# Progress Log: ToxiGen Mitigation Project
## Overview
- **Goal:** Evaluate safety prompts on ToxiGen benchmark
- **Timeline:** 3 weeks (15 working days)
- **Model:** Llama 3.1 8B (local)
- **Started:** [DATE]
---
## Day 1 - [DATE]
### Plan
- [ ] Create Codeberg account and set up SSH
- [ ] Install Python dependencies
- [ ] Install Ollama and download Llama 3.1 8B
- [ ] Verify PyTorch MPS
- [ ] Read ToxiGen paper
### What I Did
*Fill in as you work...*
### Got Stuck On
*Document blockers here...*
### Solutions/Workarounds
*How you resolved issues...*
### Tomorrow's Carryover
*Anything to continue next day...*
### Time Spent
- Codeberg setup: X hours
- Environment: X hours
- Paper reading: X hours
- **Total:** X hours
---
## Day 2 - [DATE]
### Plan
- [ ] Load ToxiGen dataset
- [ ] Explore structure
- [ ] Select subset for evaluation
...
*Continue template for each day*
Why Codeberg for progress tracking:
- Single source of truth for your project
- Git history shows your progress over time
- Can reference specific commits when issues arise
- Public visibility creates accountability
- Easy to include code snippets in markdown
Part C: Python Environment Setup (30-45 min) Link to heading
-
Create virtual environment
python -m venv venv source venv/bin/activate # On macOS -
Install dependencies
pip install torch transformers datasets detoxify pandas matplotlib seaborn jupyter -
Install Ollama and download Llama 3.1 8B
# Download Ollama from ollama.ai # Then: ollama pull llama3.1:8b -
Verify PyTorch MPS (Metal) is working
import torch print(f"MPS available: {torch.backends.mps.is_available()}") print(f"MPS built: {torch.backends.mps.is_built()}")
Part D: Paper Reading (60-90 min) Link to heading
- Read the ToxiGen paper (Hartvigsen et al., ACL 2022)
- Focus on: methodology, evaluation sections, benchmark construction
- Take notes in your
PROGRESS.mdfile
Deliverables: Link to heading
- ✅ Codeberg account created and configured
- ✅ Repository cloned with proper structure
- ✅
PROGRESS.mdfile created with Day 1 entry - ✅ Working Python environment
- ✅ Llama 3.1 running locally via Ollama
- ✅ Notes on ToxiGen approach documented
Code Checkpoint: Link to heading
import torch
from transformers import AutoModelForCausalLM, AutoTokenizer
from detoxify import Detoxify
# Verify MPS
print(f"MPS available: {torch.backends.mps.is_available()}")
print(f"MPS built: {torch.backends.mps.is_built()}")
# Test toxicity detector
toxicity_model = Detoxify('original')
result = toxicity_model.predict("This is a test sentence")
print(result)
# Test Ollama connection
import requests
response = requests.post('http://localhost:11434/api/generate',
json={'model': 'llama3.1:8b', 'prompt': 'Hello', 'stream': False})
print(response.json())
End of Day Git Commit: Link to heading
git add .
git commit -m "Day 1: Environment setup, Codeberg config, initial structure"
git push origin main
Summary of Changes to Day 1 Link to heading
| Original | Updated |
|---|---|
| No Codeberg mention | Full Codeberg setup with SSH |
| No progress tracking | PROGRESS.md template in repo |
| 2-3 hours | 3-4 hours (more realistic) |
| Basic deliverables | Includes repo structure and tracking system |
The progress tracking template includes:
- Plan: Daily checkboxes for tasks
- What I Did: Narrative of actual work
- Got Stuck On: Blockers and challenges
- Solutions: How you resolved them
- Carryover: Items for next day
- Time Spent: Actual time tracking
This gives you a reproducible work log that will be invaluable when writing up your results in Week 3!