INDEX
Explanations
references to prominent individuals and their contributions in various fields
New Auto-Interp
Negative Logits
Leading
-0.17
anny
-0.15
ipers
-0.15
Į¨
-0.14
Leading
-0.14
leading
-0.14
unik
-0.14
versions
-0.14
irected
-0.14
leading
-0.13
POSITIVE LOGITS
best
0.68
best
0.53
-best
0.45
BEST
0.41
Best
0.41
(best
0.41
known
0.41
Best
0.40
_best
0.40
better
0.39
Activations Density 0.088%