INDEX
Explanations
references to people or things with the word "Young" in them
occurrences of the word "Young."
New Auto-Interp
Negative Logits
++++++++++++++++
-0.87
ossession
-0.76
DCS
-0.75
acle
-0.75
ãĤ´ãĥ³
-0.75
arium
-0.74
oute
-0.73
igslist
-0.72
shotgun
-0.70
illin
-0.70
POSITIVE LOGITS
lings
1.16
blood
1.09
stown
0.92
ness
0.91
ster
0.85
ening
0.85
Tang
0.84
zin
0.83
sters
0.83
paren
0.81
Activations Density 0.019%