INDEX
Explanations
phrases expressing personal opinions
New Auto-Interp
Negative Logits
Gothic
-0.77
aic
-0.69
consequential
-0.66
Window
-0.65
Conversation
-0.63
Meaning
-0.63
Azerbai
-0.63
Organizations
-0.61
Palest
-0.59
Warden
-0.59
POSITIVE LOGITS
took
1.23
knew
1.17
gave
1.17
drove
1.15
went
1.15
underwent
1.14
blew
1.13
stole
1.11
became
1.11
chose
1.10
Activations Density 1.036%