INDEX
Explanations
names of people, particularly in relation to their relationships and social dynamics
New Auto-Interp
Negative Logits
smr
-0.16
Coastal
-0.16
asics
-0.15
egin
-0.15
_lite
-0.14
UpperCase
-0.14
okie
-0.14
elts
-0.13
hereby
-0.13
.intent
-0.13
POSITIVE LOGITS
caption
0.19
dating
0.19
datings
0.17
isay
0.16
statuses
0.16
Kardashian
0.16
captions
0.16
romant
0.16
Caption
0.16
dated
0.15
Activations Density 0.112%