INDEX
Explanations
human, artificial, love, development, population
New Auto-Interp
Negative Logits
only
0.72
*
0.69
(
0.68
back
0.68
receiving
0.66
very
0.66
sometimes
0.64
to
0.64
cute
0.64
slightly
0.63
POSITIVE LOGITS
álně
1.01
Amérique
0.99
aków
0.96
ônico
0.93
Chúng
0.93
ónicos
0.91
ląd
0.89
ênio
0.89
Além
0.88
ujemo
0.88
Activations Density 0.102%