INDEX
Explanations
phrases or sentences related to detailed statements
occurrences of the word "statement."
New Auto-Interp
Negative Logits
bid
-0.78
rys
-0.77
MpServer
-0.75
itals
-0.72
elsius
-0.72
Friend
-0.72
rowd
-0.71
osponsors
-0.67
cffff
-0.67
versely
-0.67
POSITIVE LOGITS
statements
0.88
ariat
0.83
pronoun
0.81
uttered
0.81
gow
0.81
statement
0.80
ARB
0.78
regarding
0.74
unequivocally
0.73
Statement
0.72
Activations Density 0.030%