INDEX
Explanations
references to the word "about"
New Auto-Interp
Negative Logits
ModelExpression
-0.91
Efq
-0.73
Diony
-0.69
itſelf
-0.66
againſt
-0.65
himſelf
-0.65
ſy
-0.65
houſe
-0.64
uſe
-0.63
Houſe
-0.63
POSITIVE LOGITS
the
0.79
how
0.72
áról
0.62
GEBURTSDATUM
0.61
them
0.61
halfway
0.60
whom
0.59
ailangan
0.59
why
0.58
ABOUT
0.58
Activations Density 0.093%