INDEX
Explanations
instances of the word "weird" and related variations
occurrences and variations of the word "weird."
New Auto-Interp
Negative Logits
vation
-0.89
cussion
-0.80
ptives
-0.77
payers
-0.75
apers
-0.74
agles
-0.74
inders
-0.74
adr
-0.72
glas
-0.72
ailable
-0.71
POSITIVE LOGITS
ness
1.08
ly
1.07
nesses
0.94
ities
0.93
est
0.87
occurrences
0.81
twist
0.79
ishly
0.78
balls
0.77
entimes
0.76
Activations Density 0.051%