INDEX
Explanations
phrases that express a perception, feeling, or concept
New Auto-Interp
Negative Logits
kään
-0.70
Zeneca
-0.69
Walters
-0.67
Kidman
-0.65
qas
-0.64
Larsson
-0.63
Schulze
-0.63
Cáceres
-0.63
icii
-0.62
Erreferentziak
-0.62
POSITIVE LOGITS
Notion
1.15
SENSE
1.12
sense
0.99
Sense
0.99
sense
0.90
Sense
0.82
notions
0.74
ญิง
0.73
Sot
0.73
Demografie
0.70
Activations Density 0.007%