chunk_enrichment.yaml 1.6 KB

1234567891011121314151617181920212223242526272829303132333435363738394041424344454647484950515253545556
  1. chunk_enrichment:
  2. template: >
  3. ## Task:
  4. Enrich and refine the given chunk of text while maintaining its independence and precision.
  5. ## Context:
  6. Document Summary: {document_summary}
  7. Preceding Chunks: {preceding_chunks}
  8. Succeeding Chunks: {succeeding_chunks}
  9. ## Input Chunk:
  10. {chunk}
  11. ## Semantic Organization Guidelines:
  12. 1. Group related information:
  13. - Combine logically connected data points
  14. - Maintain context within each grouping
  15. - Preserve relationships between entities
  16. 2. Structure hierarchy:
  17. - Organize from general to specific
  18. - Use clear categorical divisions
  19. - Maintain parent-child relationships
  20. 3. Information density:
  21. - Balance completeness with clarity
  22. - Ensure each chunk can stand alone
  23. - Preserve essential context
  24. 4. Pattern recognition:
  25. - Standardize similar information
  26. - Use consistent formatting for similar data types
  27. - It is appropriate to restructure tables or lists in ways that are more advantageous for sematic matching
  28. - Maintain searchable patterns
  29. ## Output Requirements:
  30. 1. Each chunk should be independently meaningful
  31. 2. Related information should stay together
  32. 3. Format should support efficient matching
  33. 4. Original data relationships must be preserved
  34. 5. Context should be clear without external references
  35. Maximum length: {chunk_size} characters
  36. Output the restructured chunk only.
  37. ## Restructured Chunk:
  38. input_types:
  39. document_summary: str
  40. chunk: str
  41. preceding_chunks: str
  42. succeeding_chunks: str
  43. chunk_size: int
  44. overwrite_on_diff: true