INDEX
Explanations
central themes or primary ideas related to various topics
New Auto-Interp
Negative Logits
main
-0.19
Main
-0.19
major
-0.19
Main
-0.16
main
-0.15
little
-0.15
urette
-0.15
Firm
-0.15
Major
-0.14
iktig
-0.14
POSITIVE LOGITS
stay
0.28
driver
0.22
focus
0.22
thrust
0.22
drivers
0.20
focuses
0.19
concern
0.19
driving
0.19
reason
0.19
failing
0.19
Activations Density 0.101%