INDEX
Explanations
references to perceptions and subjective experiences
New Auto-Interp
Negative Logits
weds
-0.73
Alvarado
-0.70
rawler
-0.69
smtplib
-0.68
könyv
-0.66
McGuire
-0.66
Kidman
-0.66
Zeneca
-0.65
pestaña
-0.64
Optim
-0.64
POSITIVE LOGITS
sense
2.32
SENSE
2.30
Sense
2.24
sense
2.12
Sense
2.00
senses
1.82
senso
1.49
Senses
1.49
sensed
1.45
sentido
1.37
Activations Density 0.085%