INDEX
Explanations
descriptive language of scenes with people feeling tension
looking/staring
New Auto-Interp
Negative Logits
Espèce
-0.55
ifan
-0.54
'\\;'
-0.53
ویکیپدی
-0.52
rapides
-0.52
nonUne
-0.52
ंदीखरीदारी
-0.49
kokona
-0.49
pauvres
-0.47
gynhyrchwyd
-0.46
POSITIVE LOGITS
enumi
0.63
GoogleFonts
0.57
Wide
0.57
ap
0.56
neutron
0.56
Fixes
0.54
Weiterlesen
0.54
open
0.53
eaway
0.51
สบ
0.51
Activations Density 1.098%