INDEX
Explanations
references to expertise or skill in a particular subject
New Auto-Interp
Negative Logits
OPLE
-0.75
IDER
-0.69
hedon
-0.65
enegger
-0.65
Torn
-0.64
IGH
-0.64
vernment
-0.63
AAF
-0.62
tics
-0.60
Aren
-0.60
POSITIVE LOGITS
pieces
1.47
piece
1.29
mind
1.18
stroke
1.05
classes
0.98
class
0.98
fully
0.91
sonian
0.91
work
0.89
minded
0.88
Activations Density 0.074%