INDEX
Explanations
Japanese names
names of individuals associated with sports, likely athletes
New Auto-Interp
Negative Logits
Reviewer
-0.56
bulletin
-0.49
Picture
-0.47
glim
-0.46
NETWORK
-0.45
FANT
-0.45
positives
-0.45
20439
-0.45
wonderful
-0.44
anonymity
-0.44
POSITIVE LOGITS
respectively
0.85
etc
0.67
TBA
0.64
.).
0.64
]."
0.61
ĪĴ
0.60
).[
0.58
)).
0.57
};
0.57
))))
0.57
Activations Density 2.001%