INDEX
Explanations
mentions of significant personal or relational changes
New Auto-Interp
Negative Logits
ureen
-0.15
ogens
-0.14
estro
-0.14
cts
-0.14
osi
-0.14
inine
-0.14
eeper
-0.13
upe
-0.13
723
-0.13
ilton
-0.13
POSITIVE LOGITS
until
0.91
until
0.84
Until
0.76
Until
0.73
.until
0.61
hasta
0.60
_until
0.59
till
0.58
até
0.56
jusqu
0.54
Activations Density 0.223%