INDEX
Explanations
references to the concept of making something better or more meaningful
Following "it" to describe actions or states
make it appear
New Auto-Interp
Negative Logits
secours
-0.66
pleaſure
-0.65
myſelf
-0.60
poffe
-0.57
Према
-0.56
polaire
-0.56
alſo
-0.56
againſt
-0.56
الإنجليزية
-0.56
themſelves
-0.55
POSITIVE LOGITS
happen
0.88
appear
0.87
seem
0.86
look
0.86
easier
0.79
feel
0.79
available
0.77
accessible
0.75
possible
0.73
aware
0.69
Activations Density 0.138%