INDEX
Explanations
phrases related to visibility or perception
New Auto-Interp
Negative Logits
bezeichneter
-0.82
loisirs
-0.79
@"/
-0.78
universitarios
-0.74
knapp
-0.73
lunghi
-0.72
Poehler
-0.72
Hofmann
-0.71
unately
-0.69
Grath
-0.69
POSITIVE LOGITS
Sees
1.70
sees
1.67
see
1.56
saw
1.54
seen
1.52
SEE
1.48
SEEN
1.45
See
1.44
Saw
1.43
Seen
1.42
Activations Density 0.151%