INDEX
Explanations
discussions about taxation and its consequences
New Auto-Interp
Negative Logits
inal
-0.17
alach
-0.15
itis
-0.15
ych
-0.14
alone
-0.14
Amit
-0.14
aris
-0.14
رÙĬد
-0.13
anon
-0.13
alone
-0.13
POSITIVE LOGITS
actually
0.34
actually
0.32
Actually
0.27
Actually
0.27
instead
0.26
instead
0.25
rather
0.24
éĢĨ
0.24
worse
0.23
opposite
0.23
Activations Density 0.366%