INDEX
Explanations
qualitative descriptions of experiences or events
New Auto-Interp
Negative Logits
.sav
-0.15
legg
-0.15
osaur
-0.15
avern
-0.14
mons
-0.14
اسب
-0.14
AMP
-0.13
itus
-0.13
inviting
-0.13
Hund
-0.13
POSITIVE LOGITS
eye
0.32
eye
0.30
experience
0.29
Experience
0.25
experience
0.25
Eye
0.25
-eye
0.24
Eye
0.24
Experience
0.23
surreal
0.23
Activations Density 0.062%