INDEX
Explanations
phrases referring to specific objects or concepts
instances of the word "this."
New Auto-Interp
Negative Logits
osponsors
-0.89
emale
-0.81
ographies
-0.81
acers
-0.79
å§«
-0.78
ARDS
-0.77
aneers
-0.76
tops
-0.76
rights
-0.75
tones
-0.75
POSITIVE LOGITS
nifty
1.08
adorable
1.06
delightful
1.05
lovely
1.04
amazing
0.99
gorgeous
0.98
hilarious
0.97
incredible
0.97
enigmatic
0.94
wonderful
0.93
Activations Density 0.133%