INDEX
Explanations
expressions of empathy or sympathy for others
expressions of sympathy or concern for others
New Auto-Interp
Negative Logits
oller
-0.89
hess
-0.81
orbit
-0.72
itle
-0.72
soDeliveryDate
-0.71
edin
-0.70
neapolis
-0.69
nil
-0.69
anchester
-0.69
chenko
-0.68
POSITIVE LOGITS
gotten
1.13
bidden
1.10
example
1.05
awhile
0.95
geries
0.93
sake
0.92
centuries
0.91
instance
0.90
millennia
0.88
eternity
0.86
Activations Density 0.196%