INDEX
Explanations
words related to knowledge, awareness, and understanding
New Auto-Interp
Negative Logits
ItemTracker
-0.76
phrine
-0.74
otion
-0.72
isco
-0.72
acco
-0.69
onding
-0.69
erva
-0.69
berus
-0.68
avored
-0.68
thren
-0.67
POSITIVE LOGITS
ledge
1.20
ledged
1.16
how
1.06
firsthand
1.03
beforehand
0.98
lege
0.97
instinctively
0.85
ingly
0.82
how
0.82
whats
0.81
Activations Density 1.326%