INDEX
Explanations
instances of the word "very"
extreme adjectives, particularly the word "very."
New Auto-Interp
Negative Logits
lees
-0.96
Releases
-0.78
aneers
-0.76
ses
-0.73
trophies
-0.72
ults
-0.72
purchases
-0.72
launchers
-0.71
agents
-0.70
ULTS
-0.70
POSITIVE LOGITS
similar
0.78
technical
0.77
wide
0.77
representative
0.75
narrow
0.75
primitive
0.74
interesting
0.73
unique
0.73
characteristic
0.71
mixture
0.71
Activations Density 0.086%