INDEX
Explanations
categories or classification terms
references to categories or classifications
New Auto-Interp
Negative Logits
zos
-0.68
OUGH
-0.66
arbon
-0.65
ONEY
-0.65
anoia
-0.64
rets
-0.63
arie
-0.62
anew
-0.62
reconnect
-0.61
ourn
-0.60
POSITIVE LOGITS
category
3.68
categories
2.62
category
2.39
Category
2.39
Category
2.02
Categories
1.84
classification
1.67
genre
1.45
categor
1.40
ategory
1.36
Activations Density 0.012%