INDEX
Explanations
references to specific locations or establishments
New Auto-Interp
Negative Logits
ſche
-0.89
myſelf
-0.88
auffi
-0.87
pleaſure
-0.86
poffible
-0.85
houſe
-0.82
ſtate
-0.80
ſelves
-0.77
cauſe
-0.77
purpoſe
-0.77
POSITIVE LOGITS
apparently
0.93
IIRC
0.89
(?)
0.88
(?)
0.85
actually
0.82
iirc
0.82
pretty
0.81
apparently
0.81
basically
0.81
guy
0.78
Activations Density 0.463%