INDEX
Negative Logits
Bildung
-0.09
_TILE
-0.08
_credentials
-0.08
Christianity
-0.08
stealing
-0.08
Networks
-0.08
nin
-0.08
age
-0.08
bike
-0.07
/accounts
-0.07
POSITIVE LOGITS
cues
0.11
annotations
0.10
cue
0.09
dramatur
0.09
annotations
0.09
নির্দেশ
0.09
instrucciones
0.09
annotation
0.09
prescriptions
0.09
Annotations
0.09
Activations Density 0.023%