INDEX
Explanations
words related to immigration
terms related to immigration
New Auto-Interp
Negative Logits
Clyde
-0.74
trough
-0.71
derby
-0.69
Cly
-0.66
PAC
-0.65
Sag
-0.65
Gou
-0.62
Prompt
-0.62
Chevron
-0.61
frequency
-0.61
POSITIVE LOGITS
imm
4.65
immer
1.66
immers
1.60
Imm
1.44
ims
1.29
imm
1.25
im
1.21
Imm
1.21
mut
1.12
imming
1.09
Activations Density 0.019%