INDEX
Explanations
names and locations
references to locations, events, and activities
New Auto-Interp
Negative Logits
ĨĴ
-0.84
ħĭ
-0.72
elta
-0.69
umi
-0.64
ctr
-0.61
é¾įå
-0.61
cffff
-0.60
ij士
-0.59
ãĤ®
-0.58
ibaba
-0.56
POSITIVE LOGITS
on
1.56
ON
1.41
on
1.38
On
1.32
On
1.31
ons
1.12
onto
1.10
ON
1.03
onica
0.94
off
0.92
Activations Density 0.206%