INDEX
Explanations
proper nouns
references to specific names or terms, particularly those related to "Rig" or similar entities
New Auto-Interp
Negative Logits
Hurricanes
-0.72
etheless
-0.71
terday
-0.69
Despair
-0.68
Peninsula
-0.66
\">
-0.65
Hots
-0.63
ãĥĻ
-0.63
Faul
-0.63
Reagan
-0.63
POSITIVE LOGITS
glers
1.19
rig
1.07
gers
1.05
ging
1.05
idity
1.04
rid
1.03
rett
1.02
endum
1.00
ged
1.00
gin
0.97
Activations Density 0.020%