INDEX
Explanations
words related to technological utility and importance
terms related to usefulness and quality
New Auto-Interp
Negative Logits
gemony
-0.62
arlane
-0.62
livion
-0.62
uthor
-0.61
everal
-0.58
roy
-0.57
Gene
-0.56
Whitman
-0.55
Roy
-0.55
unia
-0.55
POSITIVE LOGITS
if
1.26
because
1.12
when
1.12
unless
1.01
since
1.01
considering
0.98
depending
0.97
for
0.96
whenever
0.91
when
0.88
Activations Density 0.219%