INDEX
Explanations
proper nouns
occurrences of the character "Ļ"
New Auto-Interp
Negative Logits
disadvant
-0.70
sacrific
-0.70
smugglers
-0.69
leaps
-0.66
seiz
-0.64
stride
-0.64
jog
-0.63
chunks
-0.61
dracon
-0.60
bun
-0.60
POSITIVE LOGITS
ï¸ı
1.10
tre
0.91
owners
0.86
VICE
0.83
worthiness
0.81
bernatorial
0.79
via
0.78
§
0.77
ship
0.76
Balt
0.75
Activations Density 0.302%