INDEX
Explanations
personal experiences, thoughts, or emotional reactions
expressions of personal feelings and experiences
New Auto-Interp
Negative Logits
Equality
-0.77
tide
-0.64
Pric
-0.64
Towns
-0.60
roup
-0.60
Centers
-0.60
profits
-0.59
tides
-0.59
itect
-0.58
edged
-0.58
POSITIVE LOGITS
zzo
1.13
personally
1.05
adows
1.03
adow
1.03
imei
1.00
lees
0.97
andering
0.91
cca
0.87
zz
0.77
'm
0.76
Activations Density 0.096%