INDEX
Explanations
references to various forms of media and informational resources
New Auto-Interp
Negative Logits
civilian
-0.18
islav
-0.17
/authentication
-0.16
civilians
-0.16
Gang
-0.16
issan
-0.15
æĸĩ
-0.15
/trunk
-0.15
rud
-0.14
GANG
-0.14
POSITIVE LOGITS
spoken
0.18
video
0.16
radio
0.16
ÑĥÑģÑĤ
0.16
oral
0.16
630
0.15
ushman
0.15
_hook
0.15
spoken
0.15
orsch
0.15
Activations Density 0.171%