INDEX
Explanations
proper nouns related to people or places
references to the name "Bet" or variations of it in various contexts
New Auto-Interp
Negative Logits
ĸļ
-0.81
eclipse
-0.78
VILLE
-0.77
obser
-0.70
osuke
-0.69
conservancy
-0.68
andr
-0.67
issance
-0.66
OPLE
-0.65
Babel
-0.64
POSITIVE LOGITS
hesda
1.33
ting
1.30
ray
1.18
tered
0.96
lehem
0.94
tern
0.94
terness
0.93
ters
0.93
tery
0.92
rix
0.92
Activations Density 0.033%