INDEX
Explanations
phrases related to analysis and evaluation
statements indicating the nature or status of various subjects
New Auto-Interp
Negative Logits
stood
-0.75
said
-0.69
Said
-0.67
mentioned
-0.66
icago
-0.66
RANT
-0.63
onnaissance
-0.63
imony
-0.61
idth
-0.61
congr
-0.60
POSITIVE LOGITS
indeed
1.66
NOT
1.17
not
1.15
considerably
1.08
definitely
1.08
actually
1.06
neither
1.06
substantially
1.06
unlikely
1.05
vastly
1.03
Activations Density 0.400%