INDEX
Explanations
concepts related to democracy and governance
New Auto-Interp
Negative Logits
OGND
-0.68
ⓧ
-0.63
bard
-0.61
Personendaten
-0.55
atri
-0.53
cés
-0.52
outs
-0.52
urb
-0.52
roz
-0.52
neer
-0.52
POSITIVE LOGITS
varandra
0.67
abstrait
0.64
hâte
0.59
featureID
0.57
toyage
0.55
fallu
0.52
boucles
0.52
EconPapers
0.52
ValueStyle
0.52
xymatrix
0.52
Activations Density 0.458%