INDEX
Explanations
points or arguments being made in a text
references to key points being made in discussions or arguments
New Auto-Interp
Negative Logits
DAQ
-0.86
reditary
-0.84
uthor
-0.84
destro
-0.82
notor
-0.80
eatures
-0.78
apons
-0.77
eco
-0.75
undai
-0.75
emale
-0.74
POSITIVE LOGITS
lessly
0.93
point
0.89
points
0.88
blank
0.82
posted
0.82
forward
0.82
lessness
0.77
deduction
0.75
posts
0.75
iasis
0.74
Activations Density 0.033%