Default pytorch Linear initializes the weights which is useless and slow. |
||
---|---|---|
.. | ||
diffusionmodules | ||
distributions | ||
encoders | ||
attention.py | ||
ema.py | ||
sub_quadratic_attention.py | ||
tomesd.py |
Default pytorch Linear initializes the weights which is useless and slow. |
||
---|---|---|
.. | ||
diffusionmodules | ||
distributions | ||
encoders | ||
attention.py | ||
ema.py | ||
sub_quadratic_attention.py | ||
tomesd.py |