INDEX
Explanations
the character "Ċ"
instances of numerical or date-related information in a news context
New Auto-Interp
Negative Logits
newsp
-0.68
oun
-0.65
newcom
-0.60
simul
-0.58
exha
-0.58
Bridges
-0.57
Princ
-0.54
contemplation
-0.54
Crab
-0.54
councill
-0.54
POSITIVE LOGITS
IPP
0.70
使
0.65
TY
0.63
328
0.63
Islamic
0.62
GER
0.62
Turkish
0.62
African
0.62
Philipp
0.61
BR
0.61
Activations Density 0.088%