INDEX
Explanations
phrases related to changes or transitions
symbols or special characters in text
New Auto-Interp
Negative Logits
lers
-0.82
tons
-0.78
compet
-0.69
manship
-0.63
creen
-0.63
transfer
-0.62
transfers
-0.61
constitution
-0.60
etter
-0.59
coats
-0.59
POSITIVE LOGITS
ł
1.34
ª
1.34
¡
1.17
¤
1.14
Ĵ
1.14
ı
1.13
ĸ
1.12
ij
1.12
IJ
1.11
¹
1.09
Activations Density 0.096%