INDEX
Explanations
mentions of specific organizations or substances
specific terms and names related to geopolitical issues, military groups, and scientific concepts
New Auto-Interp
Negative Logits
VID
-0.73
ahime
-0.66
falls
-0.64
eah
-0.63
href
-0.63
Kers
-0.62
age
-0.62
Wilson
-0.62
Kear
-0.60
rising
-0.59
POSITIVE LOGITS
®,
0.82
Saharan
0.76
®
0.75
âĦ¢
0.72
èĢħ
0.70
roid
0.68
xual
0.65
ervative
0.64
acher
0.64
tein
0.63
Activations Density 0.294%