INDEX
Explanations
verbs related to research and investigation
reporting research findings
New Auto-Interp
Negative Logits
ſch
-0.93
ſelf
-0.88
myſelf
-0.86
ſta
-0.82
ſelves
-0.79
ſte
-0.77
Beſ
-0.76
fjspx
-0.75
Perſ
-0.75
ſche
-0.74
POSITIVE LOGITS
and
0.42
by
0.40
in
0.37
while
0.36
.
0.35
vlastnosti
0.34
relative
0.34
via
0.34
clairement
0.33
as
0.32
Activations Density 0.098%