INDEX
Explanations
arguments or discussions related to various topics, including scientific research, health policy, mathematical competence, and government actions
New Auto-Interp
Negative Logits
backer
-0.60
Guard
-0.60
Poké
-0.56
aukee
-0.56
EMBER
-0.54
enge
-0.53
Baltimore
-0.53
ega
-0.52
Plus
-0.52
guard
-0.52
POSITIVE LOGITS
"[
0.60
they
0.59
someday
0.58
although
0.57
cher
0.56
there
0.56
"â̦
0.54
whoever
0.53
prevailed
0.53
justifies
0.53
Activations Density 12.578%