INDEX
Explanations
personal interactions and conversations
expressions of personal reflection and experiences
New Auto-Interp
Negative Logits
purportedly
-0.49
substantially
-0.48
markedly
-0.47
multinational
-0.44
CLR
-0.44
Analysis
-0.43
cour
-0.43
quartered
-0.42
sharply
-0.42
pmwiki
-0.42
POSITIVE LOGITS
fuckin
0.64
gonna
0.62
laughs
0.60
fucking
0.59
wanna
0.55
gigg
0.52
guys
0.52
laughing
0.51
kinda
0.51
feeling
0.51
Activations Density 4.625%