INDEX
Explanations
phrases related to balance and equality in terms of experience or opportunity
New Auto-Interp
Negative Logits
enzie
-0.14
tility
-0.14
ÙĬتÙĬ
-0.14
akit
-0.14
-0.14
aload
-0.14
ackle
-0.13
лен
-0.13
436
-0.13
993
-0.13
POSITIVE LOGITS
umer
0.19
oton
0.17
urg
0.15
yntax
0.15
_classifier
0.14
oplast
0.14
urga
0.14
jekt
0.14
iger
0.13
owi
0.13
Activations Density 0.243%