INDEX
Explanations
the word "is" followed by an adjective or noun phrase, often in the context of questioning or criticism
questions that seek clarification or explanation
New Auto-Interp
Negative Logits
ptoms
-0.75
brates
-0.75
eers
-0.74
aneers
-0.73
weights
-0.70
tails
-0.69
credits
-0.68
Sabres
-0.68
IMAGES
-0.67
details
-0.67
POSITIVE LOGITS
olated
1.37
olation
1.34
olate
1.16
htar
1.04
abella
0.96
omorphic
0.91
earch
0.88
peria
0.85
terness
0.84
ometric
0.83
Activations Density 0.130%