Small-Reason-8096 t1_irzvwc8 wrote on October 12, 2022 at 8:01 AM

Reply to comment by TheInfelicitousDandy in [D] Recent ML papers to implement from scratch by nullspace1729

That surprises me as there was a good Fastai version:

https://docs.fast.ai/text.models.awdlstm.html

which is built on pytorch. When I played with it ages ago the results seemed comparable to the paper, but I haven't revisited it for a while :)

Small-Reason-8096 t1_irr6g0q wrote on October 10, 2022 at 12:44 PM

Reply to [D] Recent ML papers to implement from scratch by nullspace1729

Hands down the best paper I have ever read (and reimplemented) is the ResNets paper:

https://arxiv.org/abs/1512.03385

The descriptions are clear and concise - but with enough detail to reimplement in whatever framework you like. Also, OOTB the results I got on CIFAR10 matched the paper pretty much perfectly (not always a given!).

Another good paper to try is AWD-LSTM: https://arxiv.org/pdf/1708.02182.pdf

Basically, if you are implementing and training from scratch, focus on something you can train with a smallish dataset in a reasonable period of time. I would generally steer away from LLMs and object detection / segmentation models as they require more resources to train that are commonly available!