INDEX
Explanations
mentions of the name "Lam" followed by a mix of numbers or letters
mentions of the name "Lam" and variations of the term
New Auto-Interp
Negative Logits
Subtle
-0.87
ãģį
-0.86
Occupations
-0.71
REDACTED
-0.65
IBLE
-0.65
Marketable
-0.65
Decay
-0.65
ãĥĥãĥĪ
-0.65
Fiction
-0.64
gradient
-0.63
POSITIVE LOGITS
borgh
1.40
amo
0.96
ont
0.95
inated
0.93
oufl
0.90
ouse
0.90
onte
0.89
uci
0.89
minist
0.89
azing
0.88
Activations Density 0.011%