INDEX
Explanations
conversations in a customer service or technical support setting
New Auto-Interp
Negative Logits
Kaf
-0.72
Downs
-0.69
Kits
-0.65
ham
-0.65
......
-0.64
Austral
-0.64
[&
-0.64
Hut
-0.63
Pigs
-0.62
di
-0.62
POSITIVE LOGITS
ellectual
0.90
yright
0.90
arted
0.86
romeda
0.84
emption
0.83
resa
0.81
hing
0.81
etition
0.81
rust
0.80
amina
0.79
Activations Density 0.117%