INDEX
Explanations
assertive statements or beliefs
phrases expressing personal opinions or suggestions
New Auto-Interp
Negative Logits
Done
-0.65
Submission
-0.60
Split
-0.59
Bo
-0.54
Pokemon
-0.54
':
-0.54
opter
-0.54
goo
-0.54
Corona
-0.54
\",
-0.54
POSITIVE LOGITS
therefore
1.40
however
1.07
moreover
0.98
thus
0.90
furthermore
0.85
accordingly
0.84
meanwhile
0.80
anwhile
0.80
consequently
0.79
likewise
0.76
Activations Density 0.974%