INDEX
Explanations
the name "Berg" or variations thereof, indicating a focus on a specific individual or entity
New Auto-Interp
Negative Logits
er
-0.19
rsa
-0.19
rig
-0.18
orea
-0.16
761
-0.15
curity
-0.15
pieces
-0.15
vise
-0.15
rs
-0.15
Trailer
-0.15
POSITIVE LOGITS
antino
0.21
amas
0.20
amo
0.20
undy
0.18
thora
0.18
doll
0.17
onian
0.17
emann
0.17
dorf
0.17
heimer
0.16
Activations Density 0.007%