INDEX
Explanations
phrases related to security and official authority
terms related to security and secularism
New Auto-Interp
Negative Logits
è¦ļéĨĴ
-0.69
Silk
-0.67
Gladiator
-0.67
balls
-0.66
Siberian
-0.62
irrit
-0.61
ãĥ£
-0.60
bron
-0.60
Burnett
-0.59
Legion
-0.59
POSITIVE LOGITS
recy
1.57
rets
1.54
urities
1.40
ular
1.34
aucus
1.25
ession
1.25
RET
1.23
ured
1.21
rete
1.20
retion
1.16
Activations Density 0.024%