INDEX
Explanations
references to authority figures or groups that hold significant influence or control
New Auto-Interp
Negative Logits
lenker
-0.62
EndGlobalSection
-0.49
Personendaten
-0.46
Slf
-0.44
kasarigan
-0.43
насељу
-0.41
الإنجليزية
-0.41
ProtoMessage
-0.41
čiau
-0.41
commento
-0.40
POSITIVE LOGITS
humble
0.49
semplici
0.47
simple
0.47
simple
0.47
little
0.46
dusty
0.46
poignée
0.45
little
0.44
tiny
0.43
dusty
0.43
Activations Density 0.408%