INDEX
Explanations
phrases related to proportion or composition
occurrences of the phrase "make up."
New Auto-Interp
Negative Logits
aughter
-0.71
BILITY
-0.64
ridor
-0.62
perature
-0.61
hiba
-0.61
ait
-0.61
isy
-0.61
gee
-0.60
hens
-0.60
awa
-0.60
POSITIVE LOGITS
ulates
0.83
ulate
0.73
rations
0.64
iframe
0.62
landish
0.60
bands
0.59
esty
0.58
excuses
0.58
ulated
0.58
subreddits
0.57
Activations Density 0.032%