INDEX
Explanations
phrases with the word "get"
the presence of the word "get" and its variations
New Auto-Interp
Negative Logits
palate
-0.63
*/
-0.61
barring
-0.60
anecd
-0.60
Photographer
-0.56
surv
-0.55
taboo
-0.54
contemplation
-0.54
deliberations
-0.54
experiment
-0.53
POSITIVE LOGITS
rid
1.28
TING
1.19
cloneembedreportprint
1.07
away
1.02
tin
0.96
aways
0.94
ters
0.92
ãĥ³ãĤ¸
0.92
bent
0.89
chell
0.87
Activations Density 0.070%