INDEX
Explanations
words related to classification or categories
instances of the word "class" and its variations in different contexts
New Auto-Interp
Negative Logits
hiba
-0.68
aukee
-0.64
OPLE
-0.62
obin
-0.62
foreseen
-0.61
stump
-0.61
vernment
-0.61
orthy
-0.61
BAT
-0.58
vind
-0.57
POSITIVE LOGITS
ifications
1.40
ifier
1.31
ifiers
1.29
ifying
1.26
ifies
1.21
ifiable
1.16
ified
1.12
ification
1.06
ically
1.06
room
1.05
Activations Density 0.042%