INDEX
Explanations
personalized messages indicating sharing or providing something to someone
references to the reader or audience
New Auto-Interp
Negative Logits
ipal
-0.93
isa
-0.71
ended
-0.70
land
-0.66
peed
-0.65
ĸļ
-0.65
mare
-0.65
ela
-0.63
oldown
-0.63
aired
-0.63
POSITIVE LOGITS
guys
1.76
gentlemen
1.32
ladies
1.16
yourselves
1.14
folks
1.13
tub
1.13
boys
1.08
dudes
1.08
idiots
0.99
readers
0.95
Activations Density 0.139%