INDEX
Explanations
references to a specific person named Garnett
proper nouns, specifically names of people and notable figures
New Auto-Interp
Negative Logits
ledged
-0.69
FINE
-0.68
BUG
-0.64
Ober
-0.63
BRE
-0.63
Lobby
-0.62
ħĭ
-0.61
Govern
-0.60
Drone
-0.60
GY
-0.60
POSITIVE LOGITS
erers
1.12
erer
1.02
itional
1.00
omore
0.95
anguage
0.92
ers
0.91
igmat
0.88
anges
0.87
ings
0.87
ership
0.87
Activations Density 0.020%