INDEX
Explanations
words related to obstruction or hindrance
gerunds and present participles that signify ongoing actions or states
New Auto-Interp
Negative Logits
=-=-=-=-=-=-=-=-
-0.80
VIEW
-0.68
akespe
-0.66
abol
-0.65
uggage
-0.63
bara
-0.63
rophe
-0.63
=-=-=-=-
-0.62
ilts
-0.61
REP
-0.61
POSITIVE LOGITS
eding
1.33
eded
1.10
icators
0.86
ede
0.84
ication
0.84
icator
0.82
irect
0.81
Kepler
0.79
ktop
0.77
edIn
0.76
Activations Density 0.009%