INDEX
Explanations
phrases related to beliefs, opinions, and support for different causes or candidates
phrases related to belief and support for individuals or causes
New Auto-Interp
Negative Logits
DragonMagazine
-0.69
usp
-0.66
imgur
-0.65
tnc
-0.60
spection
-0.59
Studio
-0.59
escription
-0.58
irie
-0.58
arthed
-0.55
nesota
-0.55
POSITIVE LOGITS
whereas
1.14
regardless
1.10
but
1.01
irrespective
0.99
insofar
0.99
because
0.93
albeit
0.87
despite
0.86
yes
0.86
therefore
0.85
Activations Density 0.331%