INDEX
Explanations
symbols and special characters, particularly the character 'è¡'
non-standard characters or symbols in the text
New Auto-Interp
Negative Logits
ittal
-0.81
ategory
-0.76
ivia
-0.74
htaking
-0.69
itol
-0.68
orthy
-0.68
izabeth
-0.68
ounce
-0.67
kees
-0.66
strous
-0.66
POSITIVE LOGITS
Coffee
0.76
76561
0.72
Dial
0.71
coffee
0.71
Fire
0.69
Rew
0.67
Champ
0.67
åĤ
0.62
RAW
0.61
1945
0.60
Activations Density 0.000%