INDEX
Explanations
words related to doors or the interior of buildings
uncommon or non-English words
New Auto-Interp
Negative Logits
thread
-0.85
Thread
-0.76
threads
-0.75
Thread
-0.75
thread
-0.74
threads
-0.59
THREAD
-0.56
Threads
-0.54
线程
-0.54
THREAD
-0.52
POSITIVE LOGITS
uxxxx
0.76
المناصب
0.71
enterOuterAlt
0.69
stiefel
0.68
Descriptors
0.66
Frames
0.65
expandindo
0.64
rovnik
0.64
configureStore
0.63
цездатний
0.63
Activations Density 1.139%