INDEX
Explanations
phrases related to personal experiences and reflections
New Auto-Interp
Negative Logits
.","
-0.76
ÂŃ
-0.75
jointly
-0.74
concess
-0.73
lawfully
-0.70
thereto
-0.69
multim
-0.68
practicable
-0.68
locally
-0.68
biod
-0.67
POSITIVE LOGITS
fuck
0.92
laughs
0.90
Anyway
0.90
yrics
0.90
yeah
0.87
Shit
0.85
eeee
0.85
Laughs
0.84
fuck
0.84
laughter
0.84
Activations Density 0.824%