INDEX
Explanations
discussions related to various types of bans and regulations
New Auto-Interp
Negative Logits
ãĥªãĥ¼ãĤº
-0.17
æľĢæĸ°
-0.15
ä¸įæĸŃ
-0.15
apus
-0.15
ç»ĩ
-0.14
æ³Ĭ
-0.14
statutory
-0.14
/latest
-0.14
recent
-0.13
ç²¾åĵģ
-0.13
POSITIVE LOGITS
certain
0.44
Certain
0.38
Certain
0.36
certains
0.29
bestimm
0.26
anyone
0.24
æŁIJ
0.21
ertain
0.21
ish
0.20
anybody
0.20
Activations Density 0.331%