INDEX
Explanations
temptation
This neuron responds to occurrences of the stem “tempt” (as in “tempt,” “temptation,” “temptations”).
New Auto-Interp
Negative Logits
ologické
-0.08
кле
-0.07
해야
-0.07
Doctor
-0.06
aseg
-0.06
printStats
-0.06
технолог
-0.06
göre
-0.06
狀
-0.06
adius
-0.06
POSITIVE LOGITS
tempting
0.11
temptation
0.11
tempted
0.10
tempt
0.09
Converts
0.07
кроме
0.07
0.07
Tem
0.07
-access
0.06
.proto
0.06
Activations Density 0.007%