INDEX
Explanations
instances of the word "just" used to downplay significance or deliver information
the word "just" in various contexts
New Auto-Interp
Negative Logits
seiz
-0.73
undai
-0.71
necks
-0.71
challeng
-0.68
ught
-0.67
destro
-0.67
sacrific
-0.65
pora
-0.65
glomer
-0.64
antis
-0.62
POSITIVE LOGITS
ifiable
1.28
ifications
1.04
kidding
0.94
IFIED
0.93
if
0.89
plain
0.87
ified
0.86
ices
0.86
shy
0.80
itia
0.76
Activations Density 0.067%