INDEX
Explanations
numerical values or measurements in a text
New Auto-Interp
Negative Logits
condem
-0.65
disposed
-0.65
redesign
-0.64
dayName
-0.63
dstg
-0.62
agre
-0.60
performance
-0.56
lapt
-0.56
ertodd
-0.54
leep
-0.53
POSITIVE LOGITS
those
0.83
us
0.80
these
0.79
them
0.79
those
0.79
course
0.76
them
0.76
course
0.73
our
0.72
these
0.71
Activations Density 2.841%