INDEX
Explanations
references to location-specific nouns and their associated articles
Article in Spanish, French, or Italian
Los / les / negli / anciens + noun
New Auto-Interp
Negative Logits
Majefty
-0.93
fubject
-0.93
caufe
-0.93
purpoſe
-0.92
poffible
-0.92
itſelf
-0.91
reaſon
-0.88
ftate
-0.88
occafion
-0.88
ſtate
-0.87
POSITIVE LOGITS
efforts
0.46
inputs
0.42
bonitos
0.42
bons
0.39
insights
0.38
possibles
0.37
seus
0.37
ושים
0.36
кет
0.35
ctivos
0.35
Activations Density 0.028%