Commit c355157
Staging to Main branch (#1492)
* different url web page having same title issue fixed (#1110)
* Text file encoding issue (#1126)
* Resolved UnicodeDecodeError for some txt files
* changes done to process utf-8 file with unstructured only
* Resolved UnicodeDecodeError issue for files having other than utf-8 encoding (#1128)
* Resolved UnicodeDecodeError for some txt files
* changes done to process utf-8 file with unstructured only
* ensure file path is correct and as per intended format to avoid security risks
* Sanitizing additional instruction (#1130)
* resolved UnboundLocalError: local variable 'graphDb_data_Access' referenced before assignment (#1129)
* connection not there message for data resources (#1131)
* dockerfile updates and utils functions change
* fix: readonly issue fix
* Resolved uploaded file extraction failing on deployed version (#1136)
* UI fixes v0.7.2 (#1138)
* UI fixes
* dropdown option fix
* Message Overlay and button size fixes
* select components size change from large to small post package update (#1135)
* connection status check
* connection check update
* fix
* added dropdown for graph visulization
* fixes: added explore icon fixed responsivness issue
* document_status api fix
---------
Co-authored-by: Prakriti Solankey <156313631+prakriti-solankey@users.noreply.github.com>
* Update BreakDownPopOver.tsx
* chunk_count_val
* type error
* spell fixes and protected route fixes
* top entities not found - bug resolved (#1150)
* top entities not fpund bug resolved
* refined code
* limiting content fetching to current wikipedia page (#1151)
* added the link for login redirectig
* removed loading statw
* added the padding and changed the message
* Bump re-resizable from 6.9.17 to 6.11.2 in /frontend (#1149)
Bumps [re-resizable](https://github.com/bokuweb/react-resizable-box) from 6.9.17 to 6.11.2.
- [Release notes](https://github.com/bokuweb/react-resizable-box/releases)
- [Changelog](https://github.com/bokuweb/re-resizable/blob/master/CHANGELOG.md)
- [Commits](https://github.com/bokuweb/react-resizable-box/commits)
---
updated-dependencies:
- dependency-name: re-resizable
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
* Bump eslint-plugin-react from 7.35.0 to 7.37.4 in /frontend (#1148)
Bumps [eslint-plugin-react](https://github.com/jsx-eslint/eslint-plugin-react) from 7.35.0 to 7.37.4.
- [Release notes](https://github.com/jsx-eslint/eslint-plugin-react/releases)
- [Changelog](https://github.com/jsx-eslint/eslint-plugin-react/blob/master/CHANGELOG.md)
- [Commits](https://github.com/jsx-eslint/eslint-plugin-react/compare/v7.35.0...v7.37.4)
---
updated-dependencies:
- dependency-name: eslint-plugin-react
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
* Bump @types/node from 20.14.14 to 22.13.9 in /frontend (#1152)
Bumps [@types/node](https://github.com/DefinitelyTyped/DefinitelyTyped/tree/HEAD/types/node) from 20.14.14 to 22.13.9.
- [Release notes](https://github.com/DefinitelyTyped/DefinitelyTyped/releases)
- [Commits](https://github.com/DefinitelyTyped/DefinitelyTyped/commits/HEAD/types/node)
---
updated-dependencies:
- dependency-name: "@types/node"
dependency-type: direct:development
update-type: version-update:semver-major
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
* Bump eslint-config-prettier from 8.10.0 to 10.0.2 in /frontend (#1146)
Bumps [eslint-config-prettier](https://github.com/prettier/eslint-config-prettier) from 8.10.0 to 10.0.2.
- [Release notes](https://github.com/prettier/eslint-config-prettier/releases)
- [Changelog](https://github.com/prettier/eslint-config-prettier/blob/main/CHANGELOG.md)
- [Commits](https://github.com/prettier/eslint-config-prettier/compare/v8.10.0...v10.0.2)
---
updated-dependencies:
- dependency-name: eslint-config-prettier
dependency-type: direct:development
update-type: version-update:semver-major
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
* Bump react-dropzone from 14.3.5 to 14.3.8 in /frontend (#1145)
Bumps [react-dropzone](https://github.com/react-dropzone/react-dropzone) from 14.3.5 to 14.3.8.
- [Release notes](https://github.com/react-dropzone/react-dropzone/releases)
- [Commits](https://github.com/react-dropzone/react-dropzone/compare/v14.3.5...v14.3.8)
---
updated-dependencies:
- dependency-name: react-dropzone
dependency-type: direct:development
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
* Update dependabot.yml
* Update the query to check DB is gds version (#1153)
* Entity details shown for entity mode (#1154)
* Dev (#1132)
* Read only mode for unauthenticated users (#1046)
* llm name changes
* build fix
* default mode fix
* ragas model names update
* lint fixes
* Chunk Entities API condition
* added the tooltip for unsupported lllms for ragas metric loading
* removed unused imports
* multimode fix when we get error response
* mode changes for score display
* fix: Fixed the details state handling between multiple chats
feature: Added the warning banner If selected llm model is not supported for raga's evaluation
* Fix: Entity Mode Width Fix
* diffbot fix for async (#797)
* Minor changes (#798)
* added congig variable for default diffbot chat model
* fulltext index creation is skipped when the labels are empty
* entity vector change
* added optinal to communities for entity mode
* updated the entity query
---------
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
* New: Added the supported llm models for ragas evaluation
* Fix: Communitites Tab is displayed based communitites length
* added the conversation download button (#800)
* model name correction
* chatmode switch mode fix
* Add API payload GCP logging (#805)
* Adding Links to get neighboring nodes (#796)
* addition of link
* added neighbours query
* implemented with driver
* updated the query
* communitiesInfo name change
* communities.tsx removed
* api integration
* modified response
* entities change
* chunk and communities
* chunk space removal
* added element id to chunks
* loading on click
* format changes
* added file name for Dcoumrnt node
* chat token cut off model name update
* icon change
* duplicate sources removal
* Entity change
---------
Co-authored-by: vasanthasaikalluri <165021735+vasanthasaikalluri@users.noreply.github.com>
* added error message for doc retriver (#807)
* copy row (#803)
* copy row
* column for copy
* column copy
* Raga's Evaluation For Multi Modes (#806)
* Updatedmodels for ragas eval
* context utilization metrics removed
* updated supported llms for ragas
* removed context utilization
* Implemented Parallel API
* multi api calls error resolved
* MultiMode Metrics
* Fix: Metric Evalution For Single Mode
* multi modes ragas evaluation
* api payload changes
* metric api output format changed
* multi mode ragas changes
* removed pre process dataset
* api response changes
* Multimode metrics api integration
* nan error for no answer resolved
* QA integration changes
---------
Co-authored-by: kaustubh-darekar <kaustubh_darekar@persistent.com>
* lint fixes
* fix: multimode metrics state handling
fix: lint fixes
* fix: Multimode metrics mode change state issue
fix: chunk list style issue
* fix: list style fix
* Correct TYPO mistake
* added new env for ragas embedding model
* Props name changes (#811)
* Props name changes
* removed the accesstoken from row on copy action
* props changes for dropzone component
* graph view changes
---------
Co-authored-by: Prakriti Solankey <156313631+prakriti-solankey@users.noreply.github.com>
* test
* view graph
* nodes count and relationshipcount updation fix
* sourceUrl Fix
* empty string "" fix to keep the default values we should keep the value blank instead ""
* prop changes
* props changes
* retry condition update for failed files (#820)
* Chat modes name changes (#815)
* Props name changes
* removed the accesstoken from row on copy action
* updated chat mode names
* Chat Modes Name Changes
* lint fixes
* using readble format In UI
* removal of size to avoid console warning
* key add
---------
Co-authored-by: vasanthasaikalluri <165021735+vasanthasaikalluri@users.noreply.github.com>
Co-authored-by: Prakriti Solankey <156313631+prakriti-solankey@users.noreply.github.com>
* Youtube transcript fix with proxy (#822)
* update script for async func
* ragas changes for graph retrieval mode. context added in api output (#825)
* Remove extract latency from logging and add LIMIT in duplicate nodes
* Document updates (#828)
* document updated with ragas evaluation information
* formatting changes
* chatbot api documentation updated
* api details added in document
* function name changed for drop create vector index api
* Update README.md
* updated api structire in docs (#827)
* Update backend_docs.adoc
* 821 llm model listing (#823)
* added logic for document filters
* LLM models
* message change
* link added
* removed the text
---------
Co-authored-by: vasanthasaikalluri <165021735+vasanthasaikalluri@users.noreply.github.com>
* Exclude session lable node from duplicate nodes list
* Added the tooltip for disabled llm option (#835)
* node size changes
* mode removal of rows check
* formatting
* Exclude __Entity__ node label from duplicate node list
* Update README.md
* Update README.md
* Update README.md
* Update README.md
* fixed the youtube link
* Security header and GZIPMiddleware (#847)
* Added security header all API
* Add GZipMiddleware
* Chunk Text Details (#850)
* Community title added
* Added api for fetching chunk text details
* output format changed for chunk text
* integrated the service layer for chunkdata
* added the chunks
* formatting output of llm call for title generation
* formatting llm output for title generation
* added flex row
* Changes related to pagination of fetch chunk api
* Integrated the pagination
* page changes error resolved for fetch chunk api
* for get neighbours api , community title added in properties
* moving community title related changes to separate branch
* Removed Query module from fastapi import statement
* icon changes
---------
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
* Communities Id to Title (#851)
* Staging to main (#735)
* Dev (#537)
* format fixes and graph schema indication fix
* Update README.md
* added chat modes variable in env updated the readme
* spell fix
* added the chat mode in env table
* added the logos
* fixed the overflow issues
* removed the extra fix
* Fixed specific scenario "when the text from schema closes it should reopen the previous modal"
* readme changes
* removed dev console logs
* added new retrieval query (#533)
* format fixes and tab rendering fix
* fixed the setting modal reopen issue
---------
Co-authored-by: Prakriti Solankey <156313631+prakriti-solankey@users.noreply.github.com>
Co-authored-by: vasanthasaikalluri <165021735+vasanthasaikalluri@users.noreply.github.com>
* disabled the sumbit buttom on loading
* Deduplication tab (#566)
* de-duplication API
* Update De-Duplicate query
* created the Deduplication tab
* added the API service
* added the removeable tags for similar nodes in deduplication tab
* Integrate Tag
* added GraphLabel
* added loader state
* added the merge service
* integrated the merge API
* Merge Query issue fixed
* Auto refresh the duplicate nodes after merging operation
* added the description for de duplication
* reset on merging
---------
Co-authored-by: Pravesh Kumar <121786590+praveshkumar1988@users.noreply.github.com>
* Update frontend_docs.adoc (#538)
* Update frontend_docs.adoc
* doc update
* Images
* Images folder change
* Images folder change
* test image
* Update frontend_docs.adoc
* image change
* Update frontend_docs.adoc
* Update frontend_docs.adoc
* added the Graph Mode SS
* added the Query SS
* Update frontend_docs.adoc
* conflics fix
* conflict fix
* Update frontend_docs.adoc
---------
Co-authored-by: aashipandya <156318202+aashipandya@users.noreply.github.com>
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
* updated langchain versions (#565)
* Update the De-Duplication query
* Node relationship id type none issue (#547)
* de-duplication API
* Update De-Duplicate query
* Issue fixed Nodes,Relationship Id and Type None or Blank
* added the tooltips
* type fix
* Unneccory import
* added score threshold and added some error handling (#571)
* Update requirements.txt
* Tooltip and other UI fixes (#572)
* Staging To Main (#495)
* Integration_qa test (#375)
* Test IntegrationQA added
* update test cases
* update test
* update node count assertions
* test changes
* update changes
* modification test
* Code refatctor test cases
* Handle allowedlist issue in test
* test changes
* update test
* test case execution
* test chatbot updates
* test case update file
* added file
---------
Co-authored-by: Pravesh Kumar <121786590+praveshkumar1988@users.noreply.github.com>
* recent merges
* pdf deletion due to out of diskspace
* fixed status blank issue
* Rendering the file name instead of link for gcs and s3 sources in the info modal
* Convert is_cancelled value from string to bool
* added the default page size
* Issue fixed Processed chunked as 0 when file re-process again
* Youtube timestamps (#386)
* Wikipedia source to accept all valid urls
* wikipedia url to support multiple languages
* integrated wiki langauge param for extract api
* Youtube video timestamps
---------
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
* groq llm integration backend (#286)
* groq llm integration backend
* groq and description in node properties
* added groq in options
---------
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
* offset in chunks (#389)
* page number in gcs loader (#393)
* added youtube timestamps (#392)
* chat pop up button (#387)
* expand
* minimize-icon
* css changes
* chat history
* chatbot wider Side Nav
* expand icon
* chatbot UI
* Delete
* merge fixes
* code suggestions
---------
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
* chunks create before extraction using is_pre_process variable (#383)
* chunks create before extraction using is_pre_process variable
* Return total pages for Model
* update requirement.txt
* total pages on uplaod API
* added the Confirmation Dialog
* added the selected files into the confirmation modal
* format and lint fixes
* added the stop watch image
* fileselection on alert dialog
* Add timeout in docker for gunicorn workers
* Add cancel icon to info popup (#384)
* Info Modal Changes
* css changes
* recent merges
* Integration_qa test (#375)
* Test IntegrationQA added
* update test cases
* update test
* update node count assertions
* test changes
* update changes
* modification test
* Code refatctor test cases
* Handle allowedlist issue in test
* test changes
* update test
* test case execution
* test chatbot updates
* test case update file
* added file
---------
Co-authored-by: Pravesh Kumar <121786590+praveshkumar1988@users.noreply.github.com>
* fixed status blank issue
* Rendering the file name instead of link for gcs and s3 sources in the info modal
* added the default page size
* Convert is_cancelled value from string to bool
* Issue fixed Processed chunked as 0 when file re-process again
* Youtube timestamps (#386)
* Wikipedia source to accept all valid urls
* wikipedia url to support multiple languages
* integrated wiki langauge param for extract api
* Youtube video timestamps
---------
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
* groq llm integration backend (#286)
* groq llm integration backend
* groq and description in node properties
* added groq in options
---------
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
* Save Total Pages in DB
* Added total Pages
* file selection when we didn't select anything from Main table
* added the danger icon only for large files
* added the overflow for more files and file selection for all new files
* moved the interface to types
* added the icon accoroding to the source
* set total page for wiki and youtube
* h3 heading
* merge
* updated the alert on basis if total pages
* deleted chunks
* polling based on total pages
* isNan check
* large file based on file size for s3 and gcs
* file source in server side event
* time calculation based on chunks for gcs and s3
---------
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
Co-authored-by: Prakriti Solankey <156313631+prakriti-solankey@users.noreply.github.com>
Co-authored-by: abhishekkumar-27 <164544129+abhishekkumar-27@users.noreply.github.com>
Co-authored-by: aashipandya <156318202+aashipandya@users.noreply.github.com>
* fixed the layout issue
* Populate graph schema (#399)
* crreate new endpoint populate_graph_schema and update the query for getting lables from DB
* Added main.py changes
* conditionally-including-the-gcs-login-flow-in-gcs-as-source (#396)
* added the condtion
* removed llms
* Fixed issue : Remove extra unused param
* get emb only if used (#278)
* Chatbot chunks (#402)
* Added file name to the content sent to LLM
* added chunk text in the response
* increased the docs parts sent to llm
* Modified graph query
* mardown rendering
* youtube starttime
* icons
* offset changes
* removed the files due to codespace space issue
---------
Co-authored-by: vasanthasaikalluri <165021735+vasanthasaikalluri@users.noreply.github.com>
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
* Settings modal to support generating the labels from the llm by using text given by user (#405)
* added the json
* added schema from text dialog
* integrated the schemaAPI
* added the alert
* resize fixes
* fixed css issue
* fixed status blank issue
* Modified response when no docs is retrived (#413)
* Fixed env/docker-compose for local deployments + README doc (#410)
* Fixed env/docker-compose for local deployments + README doc
* wrong place for ENV in README
* by default, removed langsmith + fixed knn score string to float
* by default, removed langsmith + fixed knn score string to float
* Fixed strings in docker-compose env
* Added requirements (neo4j 5.15 or later, APOC, and instructions for Neo4j Desktop)
* Missed the TIME_PER_PAGE env, was causing NaN issue in the approx time processing notification. fixed that
* Support for all unstructured files (#401)
* all unstructured files
* responsiveness
* added file type
* added the extensions
* spell mistake
* ppt file changes
---------
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
* Settings modal to support generating the labels from the llm by using text given by user with checkbox (#415)
* added the json
* added schema from text dialog
* integrated the schemaAPI
* added the alert
* resize fixes
* Extract schema using direct ChatOpenAI API and Chain
* integrated the checkbox for schema to text dialog
* Update SettingModal.tsx
---------
Co-authored-by: Pravesh Kumar <121786590+praveshkumar1988@users.noreply.github.com>
* gcs file content read via storage client (#417)
* gcs file content read via storage client
* added the access token the file state
---------
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
* pypdf2 to read files from gcs (#420)
* 407 remove driver from frontend (#416)
* removed driver
* removed API
* connecting to database on page refresh
---------
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
* Css handling of info modal and Tooltips (#418)
* css change
* toolTips
* Sidebar Tooltips
* copy to clip
* css change
* added image types
* added gcs
* type fix
* docker changes
* speech
* added the toolip for dropzone sources
---------
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
* Fixed retrival bugs (#421)
* yarn format fixes
* changed the delete message
* added the cancel button
* changed the message on tooltip
* added space
* UI fixes
* tooltip for setting
* updated req
* wikipedia URL input (#424)
* accept only wikipedia links
* added wikipedia link
* added wikilink regex
* wikipedia single url only
* changed the alert message
* wording change
* pushed validation state persist error
---------
Co-authored-by: aashipandya <156318202+aashipandya@users.noreply.github.com>
* speech and copy (#422)
* speech and copy
* startTime
* added chunk properties
* tooltips
---------
Co-authored-by: vasanthasaikalluri <165021735+vasanthasaikalluri@users.noreply.github.com>
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
* Fixed issue for out of range in KNN API
* solved conflicts
* conflict solved
* Remove logging info from update KNN API
* tooltip changes
* format and lint fixes
* responsiveness changes
* Fixed issue for total pages GCS, S3
* UI polishing (#428)
* button and tooltip changes
* checking validation on change
* settings module populate fix
* format fixes
* opening the modal after auth success
* removed the limit
* added the scrobar for dropdowns
* speech state (#426)
* speech state
* Button Details changes
* delete wording change
* Total pages in buckets (#431)
* page number NA for buckets
* added N/A for gcs and s3 pages
* total pages for gcs
* remove unwanted logger
---------
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
* removed the max width
* Update FileTable.tsx
* Update the docker file
* Modified prompt (#438)
* Update Dockerfile
* Update Dockerfile
* Update Dockerfile
* rendering Fix
* Local file upload gcs (#442)
* Uplaod file to GCS
* GCS local upload fixed issue and delete file from GCS after processing and failed or cancelled
* Add life cycle rule on uploaded bucket
* pdf upload local and gcs bucket check
* delete files when processed and extract changes
---------
Co-authored-by: Pravesh Kumar <121786590+praveshkumar1988@users.noreply.github.com>
* Modified chat length and entities used (#443)
* metadata for unstructured files (#446)
* Unstructured file metadata (#447)
* metadata for unstructured files
* sleep in gcs upload
* updated
* icons added to chunks (#435)
* icons added to chunks
* info modal icons
* Dev (#433)
* Integration_qa test (#375)
* Test IntegrationQA added
* update test cases
* update test
* update node count assertions
* test changes
* update changes
* modification test
* Code refatctor test cases
* Handle allowedlist issue in test
* test changes
* update test
* test case execution
* test chatbot updates
* test case update file
* added file
---------
Co-authored-by: Pravesh Kumar <121786590+praveshkumar1988@users.noreply.github.com>
* recent merges
* pdf deletion due to out of diskspace
* fixed status blank issue
* Rendering the file name instead of link for gcs and s3 sources in the info modal
* Convert is_cancelled value from string to bool
* added the default page size
* Issue fixed Processed chunked as 0 when file re-process again
* Youtube timestamps (#386)
* Wikipedia source to accept all valid urls
* wikipedia url to support multiple languages
* integrated wiki langauge param for extract api
* Youtube video timestamps
---------
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
* groq llm integration backend (#286)
* groq llm integration backend
* groq and description in node properties
* added groq in options
---------
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
* offset in chunks (#389)
* page number in gcs loader (#393)
* added youtube timestamps (#392)
* chat pop up button (#387)
* expand
* minimize-icon
* css changes
* chat history
* chatbot wider Side Nav
* expand icon
* chatbot UI
* Delete
* merge fixes
* code suggestions
---------
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
* chunks create before extraction using is_pre_process variable (#383)
* chunks create before extraction using is_pre_process variable
* Return total pages for Model
* update requirement.txt
* total pages on uplaod API
* added the Confirmation Dialog
* added the selected files into the confirmation modal
* format and lint fixes
* added the stop watch image
* fileselection on alert dialog
* Add timeout in docker for gunicorn workers
* Add cancel icon to info popup (#384)
* Info Modal Changes
* css changes
* recent merges
* Integration_qa test (#375)
* Test IntegrationQA added
* update test cases
* update test
* update node count assertions
* test changes
* update changes
* modification test
* Code refatctor test cases
* Handle allowedlist issue in test
* test changes
* update test
* test case execution
* test chatbot updates
* test case update file
* added file
---------
Co-authored-by: Pravesh Kumar <121786590+praveshkumar1988@users.noreply.github.com>
* fixed status blank issue
* Rendering the file name instead of link for gcs and s3 sources in the info modal
* added the default page size
* Convert is_cancelled value from string to bool
* Issue fixed Processed chunked as 0 when file re-process again
* Youtube timestamps (#386)
* Wikipedia source to accept all valid urls
* wikipedia url to support multiple languages
* integrated wiki langauge param for extract api
* Youtube video timestamps
---------
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
* groq llm integration backend (#286)
* groq llm integration backend
* groq and description in node properties
* added groq in options
---------
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
* Save Total Pages in DB
* Added total Pages
* file selection when we didn't select anything from Main table
* added the danger icon only for large files
* added the overflow for more files and file selection for all new files
* moved the interface to types
* added the icon accoroding to the source
* set total page for wiki and youtube
* h3 heading
* merge
* updated the alert on basis if total pages
* deleted chunks
* polling based on total pages
* isNan check
* large file based on file size for s3 and gcs
* file source in server side event
* time calculation based on chunks for gcs and s3
---------
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
Co-authored-by: Prakriti Solankey <156313631+prakriti-solankey@users.noreply.github.com>
Co-authored-by: abhishekkumar-27 <164544129+abhishekkumar-27@users.noreply.github.com>
Co-authored-by: aashipandya <156318202+aashipandya@users.noreply.github.com>
* fixed the layout issue
* Populate graph schema (#399)
* crreate new endpoint populate_graph_schema and update the query for getting lables from DB
* Added main.py changes
* conditionally-including-the-gcs-login-flow-in-gcs-as-source (#396)
* added the condtion
* removed llms
* Fixed issue : Remove extra unused param
* get emb only if used (#278)
* Chatbot chunks (#402)
* Added file name to the content sent to LLM
* added chunk text in the response
* increased the docs parts sent to llm
* Modified graph query
* mardown rendering
* youtube starttime
* icons
* offset changes
* removed the files due to codespace space issue
---------
Co-authored-by: vasanthasaikalluri <165021735+vasanthasaikalluri@users.noreply.github.com>
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
* Settings modal to support generating the labels from the llm by using text given by user (#405)
* added the json
* added schema from text dialog
* integrated the schemaAPI
* added the alert
* resize fixes
* fixed css issue
* fixed status blank issue
* Modified response when no docs is retrived (#413)
* Fixed env/docker-compose for local deployments + README doc (#410)
* Fixed env/docker-compose for local deployments + README doc
* wrong place for ENV in README
* by default, removed langsmith + fixed knn score string to float
* by default, removed langsmith + fixed knn score string to float
* Fixed strings in docker-compose env
* Added requirements (neo4j 5.15 or later, APOC, and instructions for Neo4j Desktop)
* Missed the TIME_PER_PAGE env, was causing NaN issue in the approx time processing notification. fixed that
* Support for all unstructured files (#401)
* all unstructured files
* responsiveness
* added file type
* added the extensions
* spell mistake
* ppt file changes
---------
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
* Settings modal to support generating the labels from the llm by using text given by user with checkbox (#415)
* added the json
* added schema from text dialog
* integrated the schemaAPI
* added the alert
* resize fixes
* Extract schema using direct ChatOpenAI API and Chain
* integrated the checkbox for schema to text dialog
* Update SettingModal.tsx
---------
Co-authored-by: Pravesh Kumar <121786590+praveshkumar1988@users.noreply.github.com>
* gcs file content read via storage client (#417)
* gcs file content read via storage client
* added the access token the file state
---------
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
* pypdf2 to read files from gcs (#420)
* 407 remove driver from frontend (#416)
* removed driver
* removed API
* connecting to database on page refresh
---------
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
* Css handling of info modal and Tooltips (#418)
* css change
* toolTips
* Sidebar Tooltips
* copy to clip
* css change
* added image types
* added gcs
* type fix
* docker changes
* speech
* added the toolip for dropzone sources
---------
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
* Fixed retrival bugs (#421)
* yarn format fixes
* changed the delete message
* added the cancel button
* changed the message on tooltip
* added space
* UI fixes
* tooltip for setting
* updated req
* wikipedia URL input (#424)
* accept only wikipedia links
* added wikipedia link
* added wikilink regex
* wikipedia single url only
* changed the alert message
* wording change
* pushed validation state persist error
---------
Co-authored-by: aashipandya <156318202+aashipandya@users.noreply.github.com>
* speech and copy (#422)
* speech and copy
* startTime
* added chunk properties
* tooltips
---------
Co-authored-by: vasanthasaikalluri <165021735+vasanthasaikalluri@users.noreply.github.com>
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
* Fixed issue for out of range in KNN API
* solved conflicts
* conflict solved
* Remove logging info from update KNN API
* tooltip changes
* format and lint fixes
* responsiveness changes
* Fixed issue for total pages GCS, S3
* UI polishing (#428)
* button and tooltip changes
* checking validation on change
* settings module populate fix
* format fixes
* opening the modal after auth success
* removed the limit
* added the scrobar for dropdowns
* speech state (#426)
* speech state
* Button Details changes
* delete wording change
* Total pages in buckets (#431)
* page number NA for buckets
* added N/A for gcs and s3 pages
* total pages for gcs
* remove unwanted logger
---------
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
* removed the max width
* Update FileTable.tsx
* Update the docker file
* Modified prompt (#438)
* Update Dockerfile
* Update Dockerfile
* Update Dockerfile
* rendering Fix
* Local file upload gcs (#442)
* Uplaod file to GCS
* GCS local upload fixed issue and delete file from GCS after processing and failed or cancelled
* Add life cycle rule on uploaded bucket
* pdf upload local and gcs bucket check
* delete files when processed and extract changes
---------
Co-authored-by: Pravesh Kumar <121786590+praveshkumar1988@users.noreply.github.com>
* Modified chat length and entities used (#443)
* metadata for unstructured files (#446)
* Unstructured file metadata (#447)
* metadata for unstructured files
* sleep in gcs upload
* updated
* icons added to chunks (#435)
* icons added to chunks
* info modal icons
---------
Co-authored-by: abhishekkumar-27 <164544129+abhishekkumar-27@users.noreply.github.com>
Co-authored-by: Pravesh Kumar <121786590+praveshkumar1988@users.noreply.github.com>
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
Co-authored-by: vasanthasaikalluri <165021735+vasanthasaikalluri@users.noreply.github.com>
Co-authored-by: Prakriti Solankey <156313631+prakriti-solankey@users.noreply.github.com>
Co-authored-by: Ajay Meena <meenajy1996@gmail.com>
Co-authored-by: Morgan Senechal <morgan@neo4j.com>
Co-authored-by: karanchellani <142801957+karanchellani@users.noreply.github.com>
* fixed gcs status message issue
* added if check for failed count
* Null issue Fixed from backend for upload API and graph_document when model name mismatch
* added word break issue
* Added neo4j-rust-ext
* processing time estimation based on bytes
* File extension upper case fixed, File delete from GCS or local based on env variable.
* timer per byte
* Update Dockerfile
* Adding sort rows on the table (#451)
* Gcs upload folder hashed (#453)
* implement foldername hashed in GCS bucket uplaod
* Raise exception if invalid model selected
* folder name for gcs upload
---------
Co-authored-by: aashipandya <156318202+aashipandya@users.noreply.github.com>
* upload all unstructuredfiles to gcs (#455)
* Mofified chunk query (#454)
* Added libre office for fixing error -- soffice command was not found. Please install libreoffice
on your system and try again.
- Install instructions: https://www.libreoffice.org/get-help/install-howto/
- Mac: https://formulae.brew.sh/cask/libreoffice
- Debian: https://wiki.debian.org/LibreOffice"
* Fix the PARTIAL CONTENT issue
* File-table no data found (#456)
* 'file-table''
* review comment
* Llm format change (#459)
* changed the llm models format to lowercase
* added the error message
* llm model changes
* format fixes
* removed unused import
* added the capitalize method
* delete files from merged_file_path only if source is local file
---------
Co-authored-by: aashipandya <156318202+aashipandya@users.noreply.github.com>
* commented total page code (#460)
* format fixes
* removed the disabled check on dropdown
* Large file env
* DEV to STAGING (#461)
* Integration_qa test (#375)
* Test IntegrationQA added
* update test cases
* update test
* update node count assertions
* test changes
* update changes
* modification test
* Code refatctor test cases
* Handle allowedlist issue in test
* test changes
* update test
* test case execution
* test chatbot updates
* test case update file
* added file
---------
Co-authored-by: Pravesh Kumar <121786590+praveshkumar1988@users.noreply.github.com>
* recent merges
* pdf deletion due to out of diskspace
* fixed status blank issue
* Rendering the file name instead of link for gcs and s3 sources in the info modal
* Convert is_cancelled value from string to bool
* added the default page size
* Issue fixed Processed chunked as 0 when file re-process again
* Youtube timestamps (#386)
* Wikipedia source to accept all valid urls
* wikipedia url to support multiple languages
* integrated wiki langauge param for extract api
* Youtube video timestamps
---------
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
* groq llm integration backend (#286)
* groq llm integration backend
* groq and description in node properties
* added groq in options
---------
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
* offset in chunks (#389)
* page number in gcs loader (#393)
* added youtube timestamps (#392)
* chat pop up button (#387)
* expand
* minimize-icon
* css changes
* chat history
* chatbot wider Side Nav
* expand icon
* chatbot UI
* Delete
* merge fixes
* code suggestions
---------
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
* chunks create before extraction using is_pre_process variable (#383)
* chunks create before extraction using is_pre_process variable
* Return total pages for Model
* update requirement.txt
* total pages on uplaod API
* added the Confirmation Dialog
* added the selected files into the confirmation modal
* format and lint fixes
* added the stop watch image
* fileselection on alert dialog
* Add timeout in docker for gunicorn workers
* Add cancel icon to info popup (#384)
* Info Modal Changes
* css changes
* recent merges
* Integration_qa test (#375)
* Test IntegrationQA added
* update test cases
* update test
* update node count assertions
* test changes
* update changes
* modification test
* Code refatctor test cases
* Handle allowedlist issue in test
* test changes
* update test
* test case execution
* test chatbot updates
* test case update file
* added file
---------
Co-authored-by: Pravesh Kumar <121786590+praveshkumar1988@users.noreply.github.com>
* fixed status blank issue
* Rendering the file name instead of link for gcs and s3 sources in the info modal
* added the default page size
* Convert is_cancelled value from string to bool
* Issue fixed Processed chunked as 0 when file re-process again
* Youtube timestamps (#386)
* Wikipedia source to accept all valid urls
* wikipedia url to support multiple languages
* integrated wiki langauge param for extract api
* Youtube video timestamps
---------
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
* groq llm integration backend (#286)
* groq llm integration backend
* groq and description in node properties
* added groq in options
---------
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
* Save Total Pages in DB
* Added total Pages
* file selection when we didn't select anything from Main table
* added the danger icon only for large files
* added the overflow for more files and file selection for all new files
* moved the interface to types
* added the icon accoroding to the source
* set total page for wiki and youtube
* h3 heading
* merge
* updated the alert on basis if total pages
* deleted chunks
* polling based on total pages
* isNan check
* large file based on file size for s3 and gcs
* file source in server side event
* time calculation based on chunks for gcs and s3
---------
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
Co-authored-by: Prakriti Solankey <156313631+prakriti-solankey@users.noreply.github.com>
Co-authored-by: abhishekkumar-27 <164544129+abhishekkumar-27@users.noreply.github.com>
Co-authored-by: aashipandya <156318202+aashipandya@users.noreply.github.com>
* fixed the layout issue
* Populate graph schema (#399)
* crreate new endpoint populate_graph_schema and update the query for getting lables from DB
* Added main.py changes
* conditionally-including-the-gcs-login-flow-in-gcs-as-source (#396)
* added the condtion
* removed llms
* Fixed issue : Remove extra unused param
* get emb only if used (#278)
* Chatbot chunks (#402)
* Added file name to the content sent to LLM
* added chunk text in the response
* increased the docs parts sent to llm
* Modified graph query
* mardown rendering
* youtube starttime
* icons
* offset changes
* removed the files due to codespace space issue
---------
Co-authored-by: vasanthasaikalluri <165021735+vasanthasaikalluri@users.noreply.github.com>
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
* Settings modal to support generating the labels from the llm by using text given by user (#405)
* added the json
* added schema from text dialog
* integrated the schemaAPI
* added the alert
* resize fixes
* fixed css issue
* fixed status blank issue
* Modified response when no docs is retrived (#413)
* Fixed env/docker-compose for local deployments + README doc (#410)
* Fixed env/docker-compose for local deployments + README doc
* wrong place for ENV in README
* by default, removed langsmith + fixed knn score string to float
* by default, removed langsmith + fixed knn score string to float
* Fixed strings in docker-compose env
* Added requirements (neo4j 5.15 or later, APOC, and instructions for Neo4j Desktop)
* Missed the TIME_PER_PAGE env, was causing NaN issue in the approx time processing notification. fixed that
* Support for all unstructured files (#401)
* all unstructured files
* responsiveness
* added file type
* added the extensions
* spell mistake
* ppt file changes
---------
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
* Settings modal to support generating the labels from the llm by using text given by user with checkbox (#415)
* added the json
* added schema from text dialog
* integrated the schemaAPI
* added the alert
* resize fixes
* Extract schema using direct ChatOpenAI API and Chain
* integrated the checkbox for schema to text dialog
* Update SettingModal.tsx
---------
Co-authored-by: Pravesh Kumar <121786590+praveshkumar1988@users.noreply.github.com>
* gcs file content read via storage client (#417)
* gcs file content read via storage client
* added the access token the file state
---------
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
* pypdf2 to read files from gcs (#420)
* 407 remove driver from frontend (#416)
* removed driver
* removed API
* connecting to database on page refresh
---------
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
* Css handling of info modal and Tooltips (#418)
* css change
* toolTips
* Sidebar Tooltips
* copy to clip
* css change
* added image types
* added gcs
* type fix
* docker changes
* speech
* added the toolip for dropzone sources
---------
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
* Fixed retrival bugs (#421)
* yarn format fixes
* changed the delete message
* added the cancel button
* changed the message on tooltip
* added space
* UI fixes
* tooltip for setting
* updated req
* wikipedia URL input (#424)
* accept only wikipedia links
* added wikipedia link
* added wikilink regex
* wikipedia single url only
* changed the alert message
* wording change
* pushed validation state persist error
---------
Co-authored-by: aashipandya <156318202+aashipandya@users.noreply.github.com>
* speech and copy (#422)
* speech and copy
* startTime
* added chunk properties
* tooltips
---------
Co-authored-by: vasanthasaikalluri <165021735+vasanthasaikalluri@users.noreply.github.com>
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
* Fixed issue for out of range in KNN API
* solved conflicts
* conflict solved
* Remove logging info from update KNN API
* tooltip changes
* format and lint fixes
* responsiveness changes
* Fixed issue for total pages GCS, S3
* UI polishing (#428)
* button and tooltip changes
* checking validation on change
* settings module populate fix
* format fixes
* opening the modal after auth success
* removed the limit
* added the scrobar for dropdowns
* speech state (#426)
* speech state
* Button Details changes
* delete wording change
* Total pages in buckets (#431)
* page number NA for buckets
* added N/A for gcs and s3 pages
* total pages for gcs
* remove unwanted logger
---------
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
* removed the max width
* Update FileTable.tsx
* Update the docker file
* Modified prompt (#438)
* Update Dockerfile
* Update Dockerfile
* Update Dockerfile
* rendering Fix
* Local file upload gcs (#442)
* Uplaod file to GCS
* GCS local upload fixed issue and delete file from GCS after processing and failed or cancelled
* Add life cycle rule on uploaded bucket
* pdf upload local and gcs bucket check
* delete files when processed and extract changes
---------
Co-authored-by: Pravesh Kumar <121786590+praveshkumar1988@users.noreply.github.com>
* Modified chat length and entities used (#443)
* metadata for unstructured files (#446)
* Unstructured file metadata (#447)
* metadata for unstructured files
* sleep in gcs upload
* updated
* icons added to chunks (#435)
* icons added to chunks
* info modal icons
* fixed gcs status message issue
* added if check for failed count
* Null issue Fixed from backend for upload API and graph_document when model name mismatch
* added word break issue
* Added neo4j-rust-ext
* processing time estimation based on bytes
* File extension upper case fixed, File delete from GCS or local based on env variable.
* timer per byte
* Update Dockerfile
* Adding sort rows on the table (#451)
* Gcs upload folder hashed (#453)
* implement foldername hashed in GCS bucket uplaod
* Raise exception if invalid model selected
* folder name for gcs upload
---------
Co-authored-by: aashipandya <156318202+aashipandya@users.noreply.github.com>
* upload all unstructuredfiles to gcs (#455)
* Mofified chunk query (#454)
* Added libre office for fixing error -- soffice command was not found. Please install libreoffice
on your system and try again.
- Install instructions: https://www.libreoffice.org/get-help/install-howto/
- Mac: https://formulae.brew.sh/cask/libreoffice
- Debian: https://wiki.debian.org/LibreOffice"
* Fix the PARTIAL CONTENT issue
* File-table no data found (#456)
* 'file-table''
* review comment
* Llm format change (#459)
* changed the llm models format to lowercase
* added the error message
* llm model changes
* format fixes
* removed unused import
* added the capitalize method
* delete files from merged_file_path only if source is local file
---------
Co-authored-by: aashipandya <156318202+aashipandya@users.noreply.github.com>
* commented total page code (#460)
* format fixes
* removed the disabled check on dropdown
* Large file env
---------
Co-authored-by: abhishekkumar-27 <164544129+abhishekkumar-27@users.noreply.github.com>
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
Co-authored-by: aashipandya <156318202+aashipandya@users.noreply.github.com>
Co-authored-by: vasanthasaikalluri <165021735+vasanthasaikalluri@users.noreply.github.com>
Co-authored-by: Prakriti Solankey <156313631+prakriti-solankey@users.noreply.github.com>
Co-authored-by: Ajay Meena <meenajy1996@gmail.com>
Co-authored-by: Morgan Senechal <morgan@neo4j.com>
Co-authored-by: karanchellani <142801957+karanchellani@users.noreply.github.com>
* DEV to STAGING (#462)
* Integration_qa test (#375)
* Test IntegrationQA added
* update test cases
* update test
* update node count assertions
* test changes
* update changes
* modification test
* Code refatctor test cases
* Handle allowedlist issue in test
* test changes
* update test
* test case execution
* test chatbot updates
* test case update file
* added file
---------
Co-authored-by: Pravesh Kumar <121786590+praveshkumar1988@users.noreply.github.com>
* recent merges
* pdf deletion due to out of diskspace
* fixed status blank issue
* Rendering the file name instead of link for gcs and s3 sources in the info modal
* Convert is_cancelled value from string to bool
* added the default page size
* Issue fixed Processed chunked as 0 when file re-process again
* Youtube timestamps (#386)
* Wikipedia source to accept all valid urls
* wikipedia url to support multiple languages
* integrated wiki langauge param for extract api
* Youtube video timestamps
---------
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
* groq llm integration backend (#286)
* groq llm integration backend
* groq and description in node properties
* added groq in options
---------
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
* offset in chunks (#389)
* page number in gcs loader (#393)
* added youtube timestamps (#392)
* chat pop up button (#387)
* expand
* minimize-icon
* css changes
* chat history
* chatbot wider Side Nav
* expand icon
* chatbot UI
* Delete
* merge fixes
* code suggestions
---------
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
* chunks create before extraction using is_pre_process variable (#383)
* chunks create before extraction using is_pre_process variable
* Return total pages for Model
* update requirement.txt
* total pages on uplaod API
* added the Confirmation Dialog
* added the selected files into the confirmation modal
* format and lint fixes
* added the stop watch image
* fileselection on alert dialog
* Add timeout in docker for gunicorn workers
* Add cancel icon to info popup (#384)
* Info Modal Changes
* css changes
* recent merges
* Integration_qa test (#375)
* Test IntegrationQA added
* update test cases
* update test
* update node count assertions
* test changes
* update changes
* modification test
* Code refatctor test cases
* Handle allowedlist issue in test
* test changes
* update test
* test case execution
* test chatbot updates
* test case update file
* added file
---------
Co-authored-by: Pravesh Kumar <121786590+praveshkumar1988@users.noreply.github.com>
* fixed status blank issue
* Rendering the file name instead of link for gcs and s3 sources in the info modal
* added the default page size
* Convert is_cancelled value from string to bool
* Issue fixed Processed chunked as 0 when file re-process again
* Youtube timestamps (#386)
* Wikipedia source to accept all valid urls
* wikipedia url to support multiple languages
* integrated wiki langauge param for extract api
* Youtube video timestamps
---------
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
* groq llm integration backend (#286)
* groq llm integration backend
* groq and description in node properties
* added groq in options
---------
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
* Save Total Pages in DB
* Added total Pages
* file selection when we didn't select anything from Main table
* added the danger icon only for large files
* added the overflow for more files and file selection for all new files
* moved the interface to types
* added the icon accoroding to the source
* set total page for wiki and youtube
* h3 heading
* merge
* updated the alert on basis if total pages
* deleted chunks
* polling based on total pages
* isNan check
* large file based on file size for s3 and gcs
* file source in server side event
* time calculation based on chunks for gcs and s3
---------
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
Co-authored-by: Prakriti Solankey <156313631+prakriti-solankey@users.noreply.github.com>
Co-authored-by: abhishekkumar-27 <164544129+abhishekkumar-27@users.noreply.github.com>
Co-authored-by: aashipandya <156318202+aashipandya@users.noreply.github.com>
* fixed the layout issue
* Populate graph schema (#399)
* crreate new endpoint populate_graph_schema and update the query for getting lables from DB
* Added main.py changes
* conditionally-including-the-gcs-login-flow-in-gcs-as-source (#396)
* added the condtion
* removed llms
* Fixed issue : Remove extra unused param
* get emb only if used (#278)
* Chatbot chunks (#402)
* Added file name to the content sent to LLM
* added chunk text in the response
* increased the docs parts sent to llm
* Modified graph query
* mardown rendering
* youtube starttime
* icons
* offset changes
* removed the files due to codespace space issue
---------
Co-authored-by: vasanthasaikalluri <165021735+vasanthasaikalluri@users.noreply.github.com>
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
* Settings modal to support generating the labels from the llm by using text given by user (#405)
* added the json
* added schema from text dialog
* integrated the schemaAPI
* added the alert
* resize fixes
* fixed css issue
* fixed status blank issue
* Modified response when no docs is retrived (#413)
* Fixed env/docker-compose for local deployments + README doc (#410)
* Fixed env/docker-compose for local deployments + README doc
* wrong place for ENV in README
* by default, removed langsmith + fixed knn score string to float
* by default, removed langsmith + fixed knn score string to float
* Fixed strings in docker-compose env
* Added requirements (neo4j 5.15 or later, APOC, and instructions for Neo4j Desktop)
* Missed the TIME_PER_PAGE env, was causing NaN issue in the approx time processing notification. fixed that
* Support for all unstructured files (#401)
* all unstructured files
* responsiveness
* added file type
* added the extensions
* spell mistake
* ppt file changes
---------
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
* Settings modal to support generating the labels from the llm by using text given by user with checkbox (#415)
* added the json
* added schema from text dialog
* integrated the schemaAPI
* added the alert
* resize fixes
* Extract schema using direct ChatOpenAI API and Chain
* integrated the checkbox for schema to text dialog
* Update SettingModal.tsx
---------
Co-authored-by: Pravesh Kumar <121786590+praveshkumar1988@users.noreply.github.com>
* gcs file content read via storage client (#417)
* gcs file content read via storage client
* added the access token the file state
---------
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
* pypdf2 to read files from gcs (#420)
* 407 remove driver from frontend (#416)
* removed driver
* removed API
* connecting to database on page refresh
---------
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
* Css handling of info modal and Tooltips (#418)
* css change
* toolTips
* Sidebar Tooltips
* copy to clip
* css change
* added image types
* added gcs
* type fix
* docker changes
* speech
* added the toolip for dropzone sources
---------
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
* Fixed retrival bugs (#421)
* yarn format fixes
* changed the delete message
* added the cancel button
* changed the message on tooltip
* added space
* UI fixes
* tooltip for setting
* updated req
* wikipedia URL input (#424)
* accept only wikipedia links
* added wikipedia link
* added wikilink regex
* wikipedia single url only
* changed the alert message
* wording change
* pushed validation state persist error
---------
Co-authored-by: aashipandya <156318202+aashipandya@users.noreply.github.com>
* speech and copy (#422)
* speech and copy
* startTime
* added chunk properties
* tooltips
---------
Co-authored-by: vasanthasaikalluri <165021735+vasanthasaikalluri@users.noreply.github.com>
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
* Fixed issue for out of range in KNN API
* solved conflicts
* conflict solved
* Remove logging info from update KNN API
* tooltip changes
* format and lint fixes
* responsiveness changes
* Fixed issue for total pages GCS, S3
* UI polishing (#428)
* button and tooltip changes
* checking validation on change
* settings module populate fix
* format fixes
* opening the modal after auth success
* removed the limit
* added the scrobar for dropdowns
* speech state (#426)
* speech state
* Button Details changes
* delete wording change
* Total pages in buckets (#431)
* page number NA for buckets
* added N/A for gcs and s3 pages
* total pages for gcs
* remove unwanted logger
---------
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
* removed the max width
* Update FileTable.tsx
* Update the docker file
* Modified prompt (#438)
* Update Dockerfile
* Update Dockerfile
* Update Dockerfile
* rendering Fix
* Local file upload gcs (#442)
* Uplaod file to GCS
* GCS local upload fixed issue and delete file from GCS after processing and failed or cancelled
* Add life cycle rule on uploaded bucket
* pdf upload local and gcs bucket check
* delete files when processed and extract changes
---------
Co-authored-by: Pravesh Kumar <121786590+praveshkumar1988@users.noreply.github.com>
* Modified chat length and entities used (#443)
* metadata for unstructured files (#446)
* Unstructured file metadata (#447)
* metadata for unstructured files
* sleep in gcs upload
* updated
* icons added to chunks (#435)
* icons added to chunks
* info modal icons
* fixed gcs status message issue
* added if check for failed count
* Null issue Fixed from backend for upload API and graph_document when model name mismatch
* added word break issue
* Added neo4j-rust-ext
* processing time estimation based on bytes
* File extension upper case fixed, File delete from GCS or local based on env variable.
* timer per byte
* Update Dockerfile
* Adding sort rows on the table (#451)
* Gcs upload folder hashed (#453)
* implement foldername hashed in GCS bucket uplaod
* Raise exception if invalid model selected
* folder name for gcs upload
---------
Co-authored-by: aashipandya <156318202+aashipandya@users.noreply.github.com>
* upload all unstructuredfiles to gcs (#455)
* Mofified chunk query (#454)
* Added libre office for fixing error -- soffice command was not found. Please install libreoffice
on your system and try again.
- Install instructions: https://www.libreoffice.org/get-help/install-howto/
- Mac: https://formulae.brew.sh/cask/libreoffice
- Debian: https://wiki.debian.org/LibreOffice"
* Fix the PARTIAL CONTENT issue
* File-table no data found (#456)
* 'file-table''
* review comment
* Llm format change (#459)
* changed the llm models format to lowercase
* added the error message
* llm model changes
* format fixes
* removed unused import
* added the capitalize method
* delete files from merged_file_path only if source is local file
---------
Co-authored-by: aashipandya <156318202+aashipandya@users.noreply.github.com>
* commented total page code (#460)
* format fixes
* removed the disabled check on dropdown
* Large file env
---------
Co-authored-by: abhishekkumar-27 <164544129+abhishekkumar-27@users.noreply.github.com>
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
Co-authored-by: aashipandya <156318202+aashipandya@users.noreply.github.com>
Co-authored-by: vasanthasaikalluri <165021735+vasanthasaikalluri@users.noreply.github.com>
Co-authored-by: Prakriti Solankey <156313631+prakriti-solankey@users.noreply.github.com>
Co-authored-by: Ajay Meena <meenajy1996@gmail.com>
Co-authored-by: Morgan Senechal <morgan@neo4j.com>
Co-authored-by: karanchellani <142801957+karanchellani@users.noreply.github.com>
* added upload api
* changed the dropzone error message
* Dev to staging (#466)
* Integration_qa test (#375)
* Test IntegrationQA added
* update test cases
* update test
* update node count assertions
* test changes
* update changes
* modification test
* Code refatctor test cases
* Handle allowedlist issue in test
* test changes
* update test
* test case execution
* test chatbot updates
* test case update file
* added file
---------
Co-authored-by: Pravesh Kumar <121786590+praveshkumar1988@users.noreply.github.com>
* recent merges
* pdf deletion due to out of diskspace
* fixed status blank issue
* Rendering the file name instead of link for gcs and s3 sources in the info modal
* Convert is_cancelled value from string to bool
* added the default page size
* Issue fixed Processed chunked as 0 when file re-process again
* Youtube timestamps (#386)
* Wikipedia source to accept all valid urls
* wikipedia url to support multiple languages
* integrated wiki langauge param for extract api
* Youtube video timestamps
---------
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
* groq llm integration backend (#286)
* groq llm integration backend
* groq and description in node properties
* added groq in options
---------
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
* offset in chunks (#389)
* page number in gcs loader (#393)
* added youtube timestamps (#392)
* chat pop up button (#387)
* expand
* minimize-icon
* css changes
* chat history
* chatbot wider Side Nav
* expand icon
* chatbot UI
…1 parent 2c0421d commit c355157
56 files changed
Lines changed: 2259 additions & 725 deletions
File tree
- backend
- src
- entities
- docs
- backend
- frontend
- images
- frontend/src
- components
- ChatBot
- DataSources
- AWS
- GCS
- Local
- Graph
- Layout
- Popups
- ConnectionModal
- EmbeddingDimensionWarningModal
- GraphEnhancementDialog
- AdditionalInstructions
- EnitityExtraction
- RetryConfirmation
- UI
- User
- context
- hooks
- services
- utils
Some content is hidden
Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.
Large diffs are not rendered by default.
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
64 | 64 | | |
65 | 65 | | |
66 | 66 | | |
67 | | - | |
68 | | - | |
69 | 67 | | |
70 | 68 | | |
71 | 69 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
1 | 1 | | |
2 | | - | |
3 | | - | |
| 2 | + | |
| 3 | + | |
4 | 4 | | |
5 | 5 | | |
6 | 6 | | |
| |||
20 | 20 | | |
21 | 21 | | |
22 | 22 | | |
23 | | - | |
| 23 | + | |
24 | 24 | | |
25 | 25 | | |
26 | 26 | | |
| |||
44 | 44 | | |
45 | 45 | | |
46 | 46 | | |
47 | | - | |
48 | | - | |
49 | | - | |
50 | | - | |
51 | | - | |
52 | | - | |
53 | | - | |
| 47 | + | |
| 48 | + | |
| 49 | + | |
| 50 | + | |
| 51 | + | |
| 52 | + | |
| 53 | + | |
| 54 | + | |
Large diffs are not rendered by default.
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
39 | 39 | | |
40 | 40 | | |
41 | 41 | | |
42 | | - | |
43 | | - | |
44 | 42 | | |
45 | 43 | | |
46 | 44 | | |
| |||
291 | 289 | | |
292 | 290 | | |
293 | 291 | | |
294 | | - | |
| 292 | + | |
295 | 293 | | |
296 | 294 | | |
297 | 295 | | |
| |||
305 | 303 | | |
306 | 304 | | |
307 | 305 | | |
308 | | - | |
| 306 | + | |
309 | 307 | | |
310 | 308 | | |
311 | 309 | | |
| |||
334 | 332 | | |
335 | 333 | | |
336 | 334 | | |
337 | | - | |
| 335 | + | |
338 | 336 | | |
339 | 337 | | |
340 | 338 | | |
| |||
346 | 344 | | |
347 | 345 | | |
348 | 346 | | |
349 | | - | |
| 347 | + | |
350 | 348 | | |
351 | 349 | | |
352 | 350 | | |
| |||
397 | 395 | | |
398 | 396 | | |
399 | 397 | | |
400 | | - | |
| 398 | + | |
401 | 399 | | |
402 | 400 | | |
403 | | - | |
| 401 | + | |
404 | 402 | | |
405 | 403 | | |
406 | 404 | | |
| |||
412 | 410 | | |
413 | 411 | | |
414 | 412 | | |
415 | | - | |
| 413 | + | |
416 | 414 | | |
417 | 415 | | |
418 | 416 | | |
| |||
421 | 419 | | |
422 | 420 | | |
423 | 421 | | |
424 | | - | |
425 | | - | |
| 422 | + | |
| 423 | + | |
426 | 424 | | |
427 | 425 | | |
428 | 426 | | |
| |||
433 | 431 | | |
434 | 432 | | |
435 | 433 | | |
436 | | - | |
| 434 | + | |
437 | 435 | | |
438 | | - | |
439 | | - | |
440 | | - | |
441 | | - | |
442 | | - | |
443 | | - | |
444 | | - | |
| 436 | + | |
| 437 | + | |
| 438 | + | |
| 439 | + | |
| 440 | + | |
| 441 | + | |
| 442 | + | |
445 | 443 | | |
446 | 444 | | |
447 | 445 | | |
448 | 446 | | |
449 | 447 | | |
450 | | - | |
451 | | - | |
452 | | - | |
| 448 | + | |
| 449 | + | |
| 450 | + | |
453 | 451 | | |
454 | 452 | | |
455 | 453 | | |
| |||
664 | 662 | | |
665 | 663 | | |
666 | 664 | | |
667 | | - | |
| 665 | + | |
668 | 666 | | |
669 | 667 | | |
670 | 668 | | |
| |||
695 | 693 | | |
696 | 694 | | |
697 | 695 | | |
698 | | - | |
| 696 | + | |
699 | 697 | | |
700 | 698 | | |
701 | 699 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
316 | 316 | | |
317 | 317 | | |
318 | 318 | | |
319 | | - | |
| 319 | + | |
320 | 320 | | |
321 | | - | |
| 321 | + | |
322 | 322 | | |
323 | 323 | | |
324 | 324 | | |
| |||
361 | 361 | | |
362 | 362 | | |
363 | 363 | | |
364 | | - | |
| 364 | + | |
365 | 365 | | |
366 | 366 | | |
367 | 367 | | |
368 | 368 | | |
369 | 369 | | |
370 | | - | |
| 370 | + | |
371 | 371 | | |
372 | 372 | | |
373 | 373 | | |
374 | 374 | | |
375 | | - | |
| 375 | + | |
376 | 376 | | |
377 | | - | |
378 | | - | |
| 377 | + | |
379 | 378 | | |
380 | 379 | | |
381 | 380 | | |
| |||
467 | 466 | | |
468 | 467 | | |
469 | 468 | | |
470 | | - | |
| 469 | + | |
471 | 470 | | |
472 | 471 | | |
473 | 472 | | |
| |||
484 | 483 | | |
485 | 484 | | |
486 | 485 | | |
487 | | - | |
| 486 | + | |
488 | 487 | | |
489 | 488 | | |
490 | 489 | | |
| |||
518 | 517 | | |
519 | 518 | | |
520 | 519 | | |
521 | | - | |
| 520 | + | |
522 | 521 | | |
523 | 522 | | |
524 | 523 | | |
| |||
527 | 526 | | |
528 | 527 | | |
529 | 528 | | |
530 | | - | |
| 529 | + | |
531 | 530 | | |
532 | 531 | | |
533 | 532 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
23 | 23 | | |
24 | 24 | | |
25 | 25 | | |
26 | | - | |
| 26 | + | |
| 27 | + | |
27 | 28 | | |
28 | 29 | | |
29 | 30 | | |
| |||
46 | 47 | | |
47 | 48 | | |
48 | 49 | | |
| 50 | + | |
| 51 | + | |
49 | 52 | | |
50 | 53 | | |
51 | 54 | | |
| |||
68 | 71 | | |
69 | 72 | | |
70 | 73 | | |
| 74 | + | |
| 75 | + | |
71 | 76 | | |
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
31 | 31 | | |
32 | 32 | | |
33 | 33 | | |
| 34 | + | |
0 commit comments