INDEX
Explanations
references to people or individuals in various contexts
references to "people" and their opinions or perceptions
New Auto-Interp
Negative Logits
tnc
-0.85
srfAttach
-0.83
Accessory
-0.80
uner
-0.70
COMPLE
-0.70
NES
-0.68
efully
-0.68
iary
-0.67
UV
-0.66
Material
-0.65
POSITIVE LOGITS
smugglers
1.06
who
0.99
else
0.86
folk
0.86
perceive
0.84
wanting
0.81
underestimate
0.80
clam
0.79
realise
0.78
cared
0.76
Activations Density 0.116%