INDEX
Explanations
words related to advantages and disadvantages
words that indicate various forms of excess or downsides
New Auto-Interp
Negative Logits
¾
-0.68
floor
-0.64
Effective
-0.64
members
-0.64
YC
-0.63
VID
-0.63
IFE
-0.62
asus
-0.62
RFC
-0.62
LET
-0.61
POSITIVE LOGITS
ettings
0.97
pring
0.97
uits
0.94
abound
0.93
ensical
0.93
hooting
0.92
hift
0.92
eries
0.92
hip
0.89
thereof
0.89
Activations Density 0.113%