INDEX
Explanations
Japanese names emphasized through italicization
names of individuals, particularly those associated with film or media
New Auto-Interp
Negative Logits
ccording
-0.74
necessities
-0.62
differed
-0.61
entails
-0.60
unfavorable
-0.60
suppose
-0.59
Reviewer
-0.59
Asia
-0.59
PASS
-0.58
actionDate
-0.58
POSITIVE LOGITS
Jr
1.33
hetti
1.00
III
0.95
JR
0.93
kson
0.90
Sr
0.90
hoff
0.89
elli
0.88
otti
0.87
meier
0.87
Activations Density 0.209%