INDEX
Explanations
words or phrases related to specific entities
references to the letter "B"
New Auto-Interp
Negative Logits
Gutenberg
-0.70
dere
-0.63
Pharaoh
-0.58
rpm
-0.58
punt
-0.58
Lerner
-0.57
acne
-0.57
Ebola
-0.56
wcsstore
-0.55
Dire
-0.55
POSITIVE LOGITS
odies
1.24
asket
1.23
amboo
1.22
antam
1.16
isexual
1.15
razen
1.14
izarre
1.14
ibli
1.14
azaar
1.12
OTH
1.12
Activations Density 0.047%