INDEX
Explanations
mentions of wholeness or completeness
references to the word "entire" in various contexts
New Auto-Interp
Negative Logits
friends
-0.75
none
-0.71
chairs
-0.71
killers
-0.70
shows
-0.68
acons
-0.66
rants
-0.65
ju
-0.65
KER
-0.64
whispers
-0.63
POSITIVE LOGITS
length
0.88
spectrum
0.86
continent
0.86
ordeal
0.86
hearted
0.84
globe
0.83
gam
0.82
arsenal
0.80
duration
0.79
heartedly
0.76
Activations Density 0.037%