INDEX
Explanations
well-written statements or text
instances of the term "written."
New Auto-Interp
Negative Logits
camp
-0.68
inas
-0.65
camps
-0.65
lag
-0.63
tech
-0.62
subs
-0.62
heights
-0.62
bot
-0.62
hub
-0.61
isolate
-0.61
POSITIVE LOGITS
written
4.06
Written
2.64
written
2.02
write
1.99
Written
1.89
wrote
1.89
writing
1.87
writ
1.76
authored
1.64
ritten
1.59
Activations Density 0.015%