INDEX
Explanations
special characters and non-English text
the special character 'Ħ' and related symbols or sequences
New Auto-Interp
Negative Logits
geries
-0.94
distracting
-0.84
distracted
-0.74
foreground
-0.71
responders
-0.69
Catalyst
-0.66
offending
-0.66
explan
-0.65
regul
-0.65
answ
-0.65
POSITIVE LOGITS
и
1.04
ski
0.92
à¸
0.91
æŃ¦
0.90
о
0.90
ij
0.89
а
0.88
Ħ
0.87
APH
0.86
ï¸
0.86
Activations Density 0.007%