INDEX
Explanations
terms related to motivation and motivational themes
New Auto-Interp
Negative Logits
ialog
-0.18
hole
-0.15
ftime
-0.15
archy
-0.15
endale
-0.14
icken
-0.14
richt
-0.14
holes
-0.14
oy
-0.14
ister
-0.14
POSITIVE LOGITS
ivated
0.29
ivation
0.27
amedi
0.24
oring
0.23
ivating
0.21
swana
0.20
mot
0.20
gomery
0.20
tram
0.19
tingham
0.19
Activations Density 0.008%