INDEX
Explanations
mentions of things being visually perceived or aesthetically described
references to visual appearances or aesthetics
New Auto-Interp
Negative Logits
Literature
-0.74
cffff
-0.73
wine
-0.68
erto
-0.64
Kurd
-0.64
Tosh
-0.63
scribed
-0.62
acus
-0.61
icipated
-0.59
Loch
-0.59
POSITIVE LOGITS
ahead
1.02
checks
0.79
finder
0.77
ENE
0.73
ups
0.73
tones
0.68
bones
0.66
etic
0.66
looks
0.66
outs
0.66
Activations Density 0.037%