INDEX
Explanations
words with special characters like "Âĵ" or "ÃŃ"
instances of the character "ĵ" in the text
New Auto-Interp
Negative Logits
apse
-0.74
ignment
-0.74
aries
-0.70
enburg
-0.69
ouched
-0.68
atos
-0.68
ako
-0.65
arial
-0.64
hattan
-0.63
uminati
-0.63
POSITIVE LOGITS
âĸĵ
0.91
DIT
0.87
¡
0.75
uses
0.74
iod
0.73
BLE
0.73
Vote
0.72
CEPT
0.72
USE
0.71
ĵ
0.71
Activations Density 0.013%