INDEX
Explanations
phrases related to decision-making and relationships
New Auto-Interp
Negative Logits
aze
-0.15
treff
-0.14
azes
-0.14
petite
-0.14
vez
-0.14
uze
-0.14
imes
-0.14
');?>"
-0.14
plusplus
-0.13
dimin
-0.13
POSITIVE LOGITS
major
0.62
big
0.54
large
0.50
huge
0.48
major
0.43
significant
0.42
massive
0.42
large
0.40
big
0.40
éĩį大
0.40
Activations Density 0.050%