INDEX
Explanations
references to tribals or tribal communities
New Auto-Interp
Negative Logits
yi
-0.17
intptr
-0.15
oven
-0.14
venir
-0.14
agna
-0.14
aved
-0.14
ointed
-0.14
seed
-0.13
avity
-0.13
ilde
-0.13
POSITIVE LOGITS
trib
0.27
utes
0.27
ute
0.23
Trib
0.23
eca
0.21
bons
0.20
ally
0.20
onacci
0.20
unal
0.19
bles
0.19
Activations Density 0.006%