INDEX
Explanations
phrases that describe types or categories of things
New Auto-Interp
Negative Logits
isoto
-0.66
chofe
-0.62
pectin
-0.62
เท่า
-0.61
parry
-0.61
Ogle
-0.60
Torrey
-0.60
περι
-0.59
mascarpone
-0.59
collet
-0.59
POSITIVE LOGITS
WebElementEntity
0.73
Something
0.70
valami
0.68
########.
0.66
utafitiHapana
0.66
متعلقه
0.65
<eos>
0.65
semacam
0.65
something
0.61
Somehow
0.61
Activations Density 0.157%