INDEX
Explanations
words related to taxation and divisive or separating elements
language related to divisive issues or taxes
New Auto-Interp
Negative Logits
ession
-0.83
kins
-0.83
ainer
-0.77
esses
-0.75
ystem
-0.75
hips
-0.74
grounds
-0.72
icht
-0.71
Chaser
-0.70
kin
-0.70
POSITIVE LOGITS
Boo
0.78
ppo
0.76
wedge
0.74
tein
0.71
ppel
0.70
hect
0.70
76561
0.70
wealth
0.68
ãĥĥãĤ¯
0.68
ĸļ
0.67
Activations Density 0.044%