INDEX
Explanations
phrases indicating something that has not been done or completed yet
phrases indicating something that has not occurred or been completed yet
New Auto-Interp
Negative Logits
ufact
-0.82
gang
-0.69
similarities
-0.65
ãĥ¼ãĥ
-0.65
chio
-0.62
ãĥ³ãĤ¸
-0.60
disproportion
-0.60
ging
-0.60
packs
-0.59
gers
-0.59
POSITIVE LOGITS
terday
0.78
?:
0.72
hin
0.71
;)
0.70
anyways
0.70
!
0.70
here
0.69
anyway
0.69
NESS
0.69
!!
0.68
Activations Density 0.028%