Added YouTube Transcript RAG pipeline using Haystack.#291
Conversation
|
Check out this pull request on See visual diffs & provide feedback on Jupyter Notebooks. Powered by ReviewNB |
| @@ -0,0 +1,429 @@ | |||
| { | |||
There was a problem hiding this comment.
I think it would be more Haystack-way to create a custom component that fetches YT transcripts, so then we can build a pipeline.
Reply via ReviewNB
| @@ -0,0 +1,429 @@ | |||
| { | |||
There was a problem hiding this comment.
| @@ -0,0 +1,429 @@ | |||
| { | |||
There was a problem hiding this comment.
That would be also easier to follow for Haystack users, if we just created a pipeline.
Reply via ReviewNB
There was a problem hiding this comment.
Hi @kacperlukawski, thanks for the detailed feedback!
I understand the changes needed:
- Wrap the transcript fetching in a custom Haystack component
- Rebuild the pipelines using the proper Haystack Pipeline way
I'll work on these changes and update the PR shortly.
There was a problem hiding this comment.
@Sanjjjayyy Exactly, I think these changes would align better with the best practices of using Haystack
Description
Added a new cookbook notebook that builds a RAG pipeline over any
YouTube video transcript using Haystack and HuggingFace free Inference API.
What this cookbook covers