INDEX
Explanations
expressions related to understanding or getting something
instances of the phrase "get it."
New Auto-Interp
Negative Logits
izable
-0.62
Mans
-0.60
avage
-0.60
hips
-0.58
Friend
-0.57
currently
-0.57
Sund
-0.57
Lar
-0.55
ãĥ©ãĥ³
-0.54
CHO
-0.54
POSITIVE LOGITS
alian
1.19
chy
1.09
unes
0.95
iner
0.91
self
0.91
ueller
0.81
atic
0.79
geist
0.77
asca
0.75
atical
0.70
Activations Density 0.151%