INDEX
Explanations
phrases related to casual conversations and interactions
expressions of casual conversation and humor
New Auto-Interp
Negative Logits
AMD
-0.53
agric
-0.52
GPU
-0.51
ordes
-0.51
quartered
-0.48
orsi
-0.47
lapt
-0.47
reliant
-0.47
Firstly
-0.47
products
-0.47
POSITIVE LOGITS
fuckin
0.87
fucking
0.73
uh
0.68
gonna
0.67
bitch
0.65
eeee
0.64
fucked
0.63
wanna
0.61
kinda
0.60
gotta
0.58
Activations Density 1.404%