INDEX
Explanations
the name "George"
references to the name "George" in various contexts
New Auto-Interp
Negative Logits
igslist
-0.79
endant
-0.76
odcast
-0.74
ateur
-0.72
ngth
-0.72
letal
-0.71
ansas
-0.71
aditional
-0.70
pees
-0.69
Repe
-0.68
POSITIVE LOGITS
Orwell
0.88
Thor
0.83
Zimmerman
0.83
Lucas
0.80
Cic
0.78
VI
0.76
iannopoulos
0.75
Soros
0.74
ORGE
0.73
Clo
0.72
Activations Density 0.014%