INDEX
Explanations
references to organizational names or groups
New Auto-Interp
Negative Logits
those
-0.69
########.
-0.62
KommentareTeilen
-0.62
quello
-0.62
those
-0.61
theirs
-0.58
their
-0.58
respectively
-0.57
the
-0.57
的那
-0.56
POSITIVE LOGITS
aim
1.17
goal
1.09
purpose
1.01
objective
0.99
目的是
0.94
total
0.89
following
0.84
objetivo
0.83
bedo
0.82
highlight
0.82
Activations Density 0.550%