INDEX
Explanations
instances of the word "Lam" or variations thereof
New Auto-Interp
Negative Logits
ãģį
-0.76
GROUND
-0.70
Occupations
-0.68
REDACTED
-0.67
Subtle
-0.66
ģĸ
-0.65
ãĥĥãĥĪ
-0.63
IMAGES
-0.61
6666
-0.60
IBLE
-0.59
POSITIVE LOGITS
borgh
1.45
ont
1.08
oufl
1.03
pling
1.01
bs
1.01
inated
1.00
ouse
1.00
oran
0.98
onte
0.98
othe
0.97
Activations Density 0.003%