INDEX
Explanations
phrases that reference a unit or entity as a whole
references to collective groups or the concept of "whole" in relation to a nation or society
New Auto-Interp
Negative Logits
andon
-0.89
ATURES
-0.69
yang
-0.66
CHAPTER
-0.64
etz
-0.64
uden
-0.64
Joy
-0.63
indal
-0.63
orange
-0.62
abel
-0.62
POSITIVE LOGITS
result
1.47
consequence
1.26
whole
1.24
standalone
0.92
matter
0.92
predictor
0.90
percentage
0.87
cohesive
0.87
multiplier
0.84
deterrent
0.84
Activations Density 0.096%