INDEX
Explanations
names of people or entities
proper nouns, particularly names and titles
New Auto-Interp
Negative Logits
ãĤµãĥ¼ãĥĨãĤ£ãĥ¯ãĥ³
-0.80
respectively
-0.75
seminal
-0.65
[&
-0.65
LOT
-0.63
STEM
-0.63
ĵĺ
-0.62
perform
-0.60
Founders
-0.60
etc
-0.59
POSITIVE LOGITS
resa
1.17
vertising
1.08
withstanding
1.05
roximately
1.04
anie
1.04
jamin
1.01
odore
1.00
hesda
0.99
alyst
0.98
ogether
0.97
Activations Density 0.322%