INDEX
Explanations
vocabulary related to various threats mentioned in a given context
references to various threats, particularly in a geopolitical context
New Auto-Interp
Negative Logits
ricks
-0.79
urses
-0.75
arist
-0.73
uesday
-0.70
alt
-0.68
Band
-0.68
uties
-0.68
ashion
-0.67
ools
-0.67
mys
-0.66
POSITIVE LOGITS
posed
1.05
threat
0.98
threat
0.89
Threat
0.87
threats
0.87
crow
0.81
deterrent
0.81
glare
0.79
xual
0.75
detection
0.74
Activations Density 0.037%