INDEX
Explanations
queries and questions directed towards others
questions and statements related to academic or intellectual discussions
New Auto-Interp
Negative Logits
xtap
-0.81
ãĤ´ãĥ³
-0.75
theless
-0.71
nonetheless
-0.70
etheless
-0.69
accordingly
-0.67
pertinent
-0.67
ossibility
-0.67
additionally
-0.66
unlikely
-0.66
POSITIVE LOGITS
..."
1.69
â̦"
1.66
..."
1.49
!'"
1.47
?"
1.44
â̦"
1.43
?'"
1.41
!"
1.40
?!"
1.40
?'
1.38
Activations Density 1.127%