INDEX
Explanations
bacterial defense mechanism
New Auto-Interp
Negative Logits
ut
0.54
ach
0.50
MP
0.48
MP
0.48
ri
0.47
are
0.46
ப்பட்டுள்ளது
0.46
tir
0.46
r
0.46
ip
0.46
POSITIVE LOGITS
dysfunctional
0.53
nearby
0.48
poorly
0.47
semisimple
0.46
করতে
0.45
farmhouse
0.45
الية
0.43
fueling
0.43
prefixed
0.43
deficient
0.42
Activations Density 0.003%