INDEX
Explanations
questions about the meaning or significance of something
phrases questioning the meaning of a concept or situation
New Auto-Interp
Negative Logits
ttes
-0.74
tions
-0.68
Pend
-0.67
visors
-0.66
visor
-0.66
diversion
-0.64
Du
-0.62
Paso
-0.62
tk
-0.62
Toy
-0.61
POSITIVE LOGITS
rawdownloadcloneembedreportprint
0.77
ELL
0.71
INGS
0.68
terday
0.66
pling
0.66
goodbye
0.65
depends
0.64
agall
0.64
л
0.64
?:
0.63
Activations Density 0.019%