INDEX
Explanations
proper nouns referring to a specific person or place
mentions of the name "Young."
New Auto-Interp
Negative Logits
ãĤ´ãĥ³
-0.85
++++++++++++++++
-0.82
ãĥķãĤ©
-0.80
raints
-0.79
DCS
-0.75
sink
-0.74
displayText
-0.71
代
-0.69
ãĥ¤
-0.69
ossession
-0.67
POSITIVE LOGITS
lings
1.12
blood
1.01
Young
0.96
Young
0.90
er
0.84
sta
0.84
ness
0.83
strom
0.81
erer
0.81
leigh
0.80
Activations Density 0.010%