INDEX
Explanations
statements about being informed or knowledgeable about various topics
instances of the word "aware" in various contexts
New Auto-Interp
Negative Logits
mini
-0.72
cell
-0.68
OPLE
-0.66
Textures
-0.66
nic
-0.65
packages
-0.62
tele
-0.62
entry
-0.62
mer
-0.62
par
-0.62
POSITIVE LOGITS
ledged
1.16
aware
1.04
Aware
0.98
isance
0.90
ness
0.86
ledge
0.84
NOTICE
0.79
icip
0.79
itaire
0.79
Lauder
0.75
Activations Density 0.019%