INDEX
Explanations
instances of gratitude or appreciation
New Auto-Interp
Negative Logits
ucci
-0.15
ombok
-0.15
BOUND
-0.14
VIA
-0.13
Wash
-0.13
wash
-0.13
ucha
-0.13
oloj
-0.13
Aw
-0.12
bakan
-0.12
POSITIVE LOGITS
BT
0.73
BT
0.65
btw
0.63
incident
0.59
by
0.52
bt
0.49
Incident
0.47
incident
0.47
INCIDENT
0.42
By
0.39
Activations Density 0.116%