INDEX
Explanations
news related to military and defense activities, as well as mentions of specific locations and names
New Auto-Interp
Negative Logits
advertisement
-0.65
berth
-0.58
ufact
-0.57
interrupted
-0.57
tesy
-0.57
Grayson
-0.56
ASIC
-0.56
takeoff
-0.56
obscurity
-0.54
mouth
-0.54
POSITIVE LOGITS
士
0.81
Ó
0.73
imaru
0.73
ovi
0.72
opic
0.71
ope
0.71
owski
0.69
etry
0.68
а
0.66
ivas
0.66
Activations Density 0.175%