Week 01 Journal
- Learning how to fine-tune BERT model using PyTorch from HuggingFace example.
- Sentiment Classification: transformers_sentiment_wandb.ipynb
- Multiclass Classification: transformers_multiclass_classification.ipynb
- Planning to leverage the power of Keras 3 which is written in their docs.
- Its hard (or not possible) because HF’s
transformerslibrary doesn’t support it but I have checked thatkeras_hublibrary has it’s own BERT model there which I can use, I think.
- Its hard (or not possible) because HF’s
- Will explore more in next week:
- How to fine-tune BERT using PyTorch + HF Transformers?
- How to fine-tune BERT using Tensorflow + HF Transformers?
-
How to fine-tune BERT using Keras3 + PyTorch + HF Transformers only?(NOT POSSIBLE) - What are these terms like SPDA, PEFT, LORA and more for fine-tuning purpose.
Week 02 Journal
- Explored new method to finetune BERT model using
transformers.TrainingArgumentsandtransformers.Trainerclasses. - Wrote a ref page where important articles are listed for finetuning BERT model.
- Commiting all the finetuning codes in the form of
pythonscript andmarimonotebook in@arv-anshul/notebooksrepo. - Explored PEFT methods like LoRA. Tried to finetune BERT model using them.
- Found a way to format my
mkdocsdocs. See@astral-sh/ruffrepo to know more about formattingmkdocsdocs. - Planning to migrate all
/refpages to/blogbecause these aren’t any different from them. Also, I don’t write blogs because I feel the articles are better fit for/refpages which reduced the usages of/blogpages.- Migration maybe broke some links that I will fix later.
Week 03 Journal
- Finally, committed the Journal Summarizer as marimo app for this diary repo.
- :star: Migrated
/refpages to/blogin website. - Also tried to format markdown files in website’s repo using mdformat-mkdocs but it’s not acceptable for me.
- Raised issue in mdformat-mkdocs repo.
- Worked on creating my resume (CV) using LaTeX (
.texformat). And tried to manage it with python script.
Week 04 Journal
- More work on resume (CV) management. Explored LateX and
pdflatexlike tools for resume management. - New repo @arv-anshul/hockey. Scrapes data related to Hockey from altiusrt.com websites using scrapy framework. And publish the data on Kaggle.
- Wrote a Kaggle notebook on sentiment classification on Reddit comments dataset using Keras.
- Exploring
deepseek-r1distill models withsmolagentsfor nlp-to-sql tasks. - Hectic exploration of LaTeX tool with Podman. (literally, very hectic)
Week 05 Journal
- Created @Suraj’s college PPT on Introduction to Systems of Human Body at his place.
- New repo @arv-anshul/resume.
- @Nitish sir (@CampusX) announced Gen AI Course for free on YouTube.
- Joined Kaggle Competition S05E01.
- Feeling clueless with next step around anything (project, learning).