INDEX
Explanations
proper nouns, possibly related to locations or names
mentions of the name "Gall."
New Auto-Interp
Negative Logits
BOOK
-0.76
CLASSIFIED
-0.68
ccording
-0.64
terday
-0.62
Helpful
-0.60
reality
-0.59
ding
-0.59
PRES
-0.58
anyahu
-0.57
huh
-0.57
POSITIVE LOGITS
agher
1.08
antry
1.06
oway
1.03
eries
1.00
oise
0.98
ium
0.97
atin
0.94
ois
0.91
esian
0.90
oping
0.89
Activations Density 0.022%