INDEX
Explanations
mentions of specific dates and events
occurrences of a specific character or symbol
New Auto-Interp
Negative Logits
scattering
-0.68
shack
-0.68
anwhile
-0.64
walking
-0.64
bye
-0.64
reception
-0.63
sitting
-0.62
Plum
-0.62
bleeding
-0.60
fed
-0.60
POSITIVE LOGITS
º
0.99
Ħ¢
0.92
į
0.92
ı
0.89
§
0.88
âĹ¼
0.88
¬
0.86
Ĩ
0.85
»
0.84
£
0.84
Activations Density 0.323%