INDEX
Explanations
specific diacritical marks and special characters in text
New Auto-Interp
Negative Logits
hot
-0.62
ban
-0.56
Revenir
-0.56
-0.55
anti
-0.53
zuführen
-0.53
styleType
-0.53
del
-0.53
data
-0.52
pan
-0.52
POSITIVE LOGITS
itſelf
1.02
Houſe
0.83
thâu
0.83
himſelf
0.81
themſelves
0.79
neceſſ
0.79
InjectAttribute
0.79
myſelf
0.78
feroit
0.77
ainfi
0.77
Activations Density 0.451%