INDEX
Explanations
references to specific individuals and concepts related to economics and media
New Auto-Interp
Negative Logits
avra
-0.17
oose
-0.16
etur
-0.16
ileo
-0.15
585
-0.15
kend
-0.14
planta
-0.14
Scoped
-0.14
yn
-0.14
ãģĺ
-0.14
POSITIVE LOGITS
-UA
0.17
uni
0.15
LLU
0.15
ùa
0.14
TAS
0.14
_Impl
0.13
BASIS
0.13
/epl
0.13
izm
0.13
-parse
0.13
Activations Density 0.004%