INDEX
Explanations
words related to vigilance and security
references to vigilance and vigilantism
New Auto-Interp
Negative Logits
HF
-0.90
bid
-0.82
UD
-0.75
#$#$
-0.71
fixed
-0.67
emb
-0.67
Hemp
-0.65
MER
-0.65
PART
-0.65
PUT
-0.64
POSITIVE LOGITS
vigilant
1.23
vigilance
0.95
millenn
0.87
igans
0.85
vigilante
0.82
iously
0.81
citiz
0.81
enthusi
0.81
conduc
0.79
ailability
0.77
Activations Density 0.010%