INDEX
Explanations
opening quotes followed by statements
New Auto-Interp
Negative Logits
Additionally
0.37
asimismo
0.37
{\0.37
additionally
0.36
Additionally
0.35
Asimismo
0.34
également
0.34
今回の
0.34
također
0.33
또한
0.33
POSITIVE LOGITS
люди
0.44
يقول
0.43
дуже
0.43
иногда
0.42
ridiculous
0.42
لوگوں
0.41
มัน
0.40
awful
0.40
logika
0.40
लोग
0.39
Activations Density 0.003%