INDEX
Explanations
phrases describing a complete or comprehensive situation or concept
references to the concept of "whole," indicating a focus on completeness or entirety in various contexts
New Auto-Interp
Negative Logits
KER
-0.72
osponsors
-0.66
ics
-0.64
ANS
-0.64
uers
-0.64
none
-0.63
acers
-0.63
cs
-0.63
Dr
-0.62
friends
-0.61
POSITIVE LOGITS
heartedly
1.34
hearted
1.23
thing
1.12
ordeal
1.05
gam
0.93
darn
0.93
damn
0.92
affair
0.87
globe
0.84
idea
0.80
Activations Density 0.031%