INDEX
Explanations
occurrences of the word "trick" in various contexts
New Auto-Interp
Negative Logits
pleaſure
-0.90
houſe
-0.87
Houſe
-0.86
ſta
-0.85
ſte
-0.82
itſelf
-0.79
myſelf
-0.78
raiſ
-0.77
ſever
-0.69
ſelf
-0.69
POSITIVE LOGITS
multiply
0.49
Multiply
0.47
multiply
0.45
trick
0.43
trick
0.42
kiin
0.40
Multiply
0.39
excelencia
0.38
multiplied
0.38
rokok
0.38
Activations Density 0.225%