INDEX
Explanations
references to political commentary and sections within a document
New Auto-Interp
Negative Logits
Ga
-0.15
estone
-0.15
彦
-0.15
fte
-0.15
presence
-0.14
URA
-0.14
à¸Ħรà¸ļ
-0.14
را
-0.14
geil
-0.14
Floors
-0.13
POSITIVE LOGITS
section
0.20
category
0.17
Als
0.16
eph
0.16
mond
0.16
Sizer
0.16
Mand
0.15
nde
0.15
Pere
0.15
zone
0.14
Activations Density 0.253%