INDEX
Explanations
references to specific locations or events with the word 'Rig' in them
references to the word "rig."
New Auto-Interp
Negative Logits
Leilan
-0.91
etheless
-0.71
Pry
-0.65
FTA
-0.64
Virgin
-0.64
birth
-0.61
sands
-0.60
Nadu
-0.60
Taiwanese
-0.60
peak
-0.60
POSITIVE LOGITS
uez
1.28
rig
1.11
gers
1.06
iosity
1.05
gling
1.04
orously
1.02
ging
0.99
uers
0.97
gment
0.93
gered
0.92
Activations Density 0.011%