INDEX
Explanations
references to individuals or teams in stressful situations or crises
New Auto-Interp
Negative Logits
itſelf
-0.93
myſelf
-0.87
themſelves
-0.78
Theſe
-0.75
whoſe
-0.74
ある
-0.72
ConverterFactory
-0.71
againſt
-0.71
高い
-0.68
himſelf
-0.68
POSITIVE LOGITS
+#+#
0.72
tvguidetime
0.63
الحره
0.57
)}_
0.57
'],$
0.55
("")]
0.54
especially
0.54
enschappe
0.54
']==
0.54
included
0.52
Activations Density 0.022%