INDEX
Explanations
proper nouns and specific details such as names, dates, and locations
references to specific events, locations, or significant terms associated with activities or happenings
New Auto-Interp
Negative Logits
ĨĴ
-0.80
elta
-0.66
jugg
-0.62
Ãį
-0.61
ibal
-0.61
adi
-0.61
idel
-0.60
ÃŃ
-0.59
Imm
-0.58
iber
-0.58
POSITIVE LOGITS
on
1.25
ON
1.21
on
1.19
ons
1.07
On
1.05
On
1.02
ON
0.99
ONS
0.93
onian
0.93
onto
0.93
Activations Density 0.144%