INDEX
Explanations
words related to being worried or troubled about something
expressions of concern
New Auto-Interp
Negative Logits
iller
-0.78
arb
-0.75
aro
-0.72
avour
-0.69
alter
-0.67
avorite
-0.66
OVA
-0.64
oba
-0.62
ingers
-0.62
heres
-0.61
POSITIVE LOGITS
about
1.32
ABOUT
1.12
about
1.08
lest
0.97
About
0.93
ingly
0.92
lessly
0.82
About
0.81
enough
0.80
regarding
0.79
Activations Density 0.053%