INDEX
Explanations
specific Arabic and Cyrillic script characters or phrases
New Auto-Interp
Negative Logits
issen
-0.15
Forbidden
-0.14
ðŁ
-0.14
autof
-0.13
ðŁ
-0.13
harbour
-0.13
nab
-0.13
ðŁĴ
-0.13
Privacy
-0.12
asion
-0.12
POSITIVE LOGITS
Disaster
0.33
disaster
0.33
tri
0.31
disasters
0.30
tri
0.28
surviv
0.28
Tri
0.26
business
0.25
catastrophe
0.25
Business
0.25
Activations Density 0.002%