INDEX
Explanations
phrases related to thoroughness or detailed investigations
phrases indicating comprehensiveness or careful examination
New Auto-Interp
Negative Logits
AIN
-0.71
Tonight
-0.68
Rebels
-0.68
qqa
-0.66
dust
-0.64
nesota
-0.62
birds
-0.62
CHAT
-0.62
agne
-0.61
ospels
-0.61
POSITIVE LOGITS
bred
1.40
fare
1.04
ness
0.99
going
0.94
thorough
0.93
examination
0.84
examinations
0.80
spection
0.79
done
0.76
spect
0.75
Activations Density 0.022%