INDEX
Explanations
references to threats
New Auto-Interp
Negative Logits
Pab
-0.50
Kingston
-0.49
Pab
-0.48
ậu
-0.44
Nana
-0.43
*)
-0.43
cob
-0.43
bgColor
-0.41
pab
-0.41
Alva
-0.41
POSITIVE LOGITS
threat
2.14
threat
1.95
Threat
1.94
Threat
1.91
threats
1.77
Threats
1.61
Threats
1.61
amenaza
1.53
threaten
1.50
threatening
1.43
Activations Density 0.005%