INDEX
Explanations
references to abstract concepts and existential inquiries
"...thing" or "...thing."
New Auto-Interp
Negative Logits
none
-0.48
none
-0.46
None
-0.46
rans
-0.43
ритори
-0.43
endwhile
-0.43
vyn
-0.42
ともに
-0.41
CreateModel
-0.41
TemporalType
-0.40
POSITIVE LOGITS
thing
3.78
THING
2.85
thing
2.79
things
2.66
Thing
2.66
Thing
2.50
Things
2.25
things
2.20
THINGS
2.19
Things
2.19
Activations Density 0.262%