INDEX
Explanations
references to specific individuals in a context involving police or government
New Auto-Interp
Negative Logits
next
-0.50
usz
-0.47
руйте
-0.40
ят
-0.40
Next
-0.39
domestic
-0.39
gener
-0.38
PhysRevLett
-0.38
aronder
-0.38
いかがでしたか
-0.38
POSITIVE LOGITS
EconPapers
1.00
فريبيس
0.98
SharedDtor
0.89
脚注の使い方
0.88
Portály
0.81
Geplaatst
0.80
0.77
مرئيه
0.76
Chwiliwch
0.76
:✨
0.75
Activations Density 0.197%