Could anyone point me towards a relatively beginner-friendly guide to do something like

>download all my tweets (about 20k) and build a semantic searcher on top ?

How can utilize 3rd party embeddings with OpenAI's LLM API? Am I correct to understand from this article that this is possible?

langchain and llama-index are two big opensource projects which are great for buildign this type of thing.

https://github.com/mayooear/gpt4-pdf-chatbot-langchain for example