INDEX
Explanations
references to questions or conversations that seek opinions or information
phrases that indicate asking or referencing questions and discussions
New Auto-Interp
Negative Logits
soever
-0.79
anooga
-0.77
iren
-0.73
Frameworks
-0.71
à¼
-0.68
Ò
-0.67
ulum
-0.67
Ê
-0.67
seed
-0.67
%%
-0.66
POSITIVE LOGITS
why
1.18
whether
1.06
how
1.04
his
1.03
wanting
0.93
possible
0.91
himself
0.84
rumors
0.84
allegations
0.84
rumours
0.80
Activations Density 0.268%