INDEX
Explanations
potential actions or decisions related to socioeconomic issues and policies
New Auto-Interp
Negative Logits
osc
-0.63
||||
-0.58
scope
-0.57
Ùĩ
-0.56
neg
-0.56
roads
-0.56
ãģł
-0.55
thro
-0.54
Ùħ
-0.54
ãģ®å®
-0.54
POSITIVE LOGITS
bestos
1.42
piring
1.34
semb
1.32
phalt
1.29
pects
1.27
ylum
1.22
piration
1.18
semble
1.16
king
1.14
ymm
1.11
Activations Density 0.239%