INDEX
Explanations
phrases that start with special characters followed by letters or numbers
specific special characters or symbols
New Auto-Interp
Negative Logits
Dupl
-0.73
creen
-0.72
wagen
-0.71
Bris
-0.70
Jeanne
-0.69
conduc
-0.68
destro
-0.68
Farn
-0.66
WithNo
-0.66
sters
-0.65
POSITIVE LOGITS
ª
1.20
Ĵ
1.09
IJ
1.08
ł
1.04
ı
1.01
¤
1.00
¹
0.98
Ĥ
0.95
ħ
0.93
ij
0.93
Activations Density 0.106%