Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Improvement] TextSplitter: return chunk start and end position #957

Open
chuang8511 opened this issue Jul 17, 2024 · 0 comments
Open

[Improvement] TextSplitter: return chunk start and end position #957

chuang8511 opened this issue Jul 17, 2024 · 0 comments

Comments

@chuang8511
Copy link

Background

According to LangChain return sources function, we can know letting users know the source is important.

The UX Part to be Improved

However, by the current way, it only returns the raw chunk text rather than pointing out the position from the raw document.
When the users want to check the raw chunk of the raw document, it can not be achieved by the way LangChain provides.

Solution

  1. Return start & end position for each chunk when the RAG system providers splits the text.
  2. The RAG system providers can use this start & end position to trace back.

Further information

Currently, I am working on another project to achieve this system.
Because I refer to LangChainGo a lot, I think it will be also helpful for the users in this domain.

If you think it is good, I will take time to integrate the logic into langchaingo.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant