It would be extremely helpful to have this simple automation action:
- Extract raw text from attached file
It would be extremely helpful to have this simple automation action:
What file types did you have in mind?
(I donāt think you would want to extract raw text from an attached .jpg file )
Up to now, external content as well as code snippets I need to insert in entities their rich text field. This is a big problem, because:
We need:
This serves two perposes:
Markdown (.md)
ā Structured knowledge blocks
ā Semantic entity conversion
Plain Text (.txt)
ā Logs or raw transcripts
ā Unstructured input for NLP
Microsoft Word (.docx)
ā Meeting notes
ā Headings and tables
PDF (text-based)
ā Reports or whitepapers
ā Legal or archival records
CSV (.csv)
ā Tabular task or metric data
ā Rows mapped to entities
JSON (.json)
ā Config or schema files
ā Structured database population
YAML (.yaml/.yml)
ā Workflow or deployment configs
ā Nested structure definitions
HTML/XHTML (.html/.xhtml)
ā Static documentation pages
ā Article bodies or metadata
Code Files (.js, .py, .sh, etc.)
ā Logic or automation scripts
ā Snippets linked to functions
Rich Text Format (.rtf)
ā Formatted notes
ā Legacy content migration
Perhaps you could automate sending your uploaded (attached to entity) files to an external service that would do the needed extraction/conversion, then store the result into another field/entity.
File API
I understand that, but given that Fibery is such a workflow-centric platform, Iām pointing to a gap in what many users would reasonably expect out of the box.
The ability to extract and work with the raw text of attached filesāespecially for content like transcripts, scripts, and codeāis a foundational capability for many knowledge and automation workflows.
Even as custom integration I would not like it, for something so basic it adds significant friction to an internal process experience.
A counter-argument might be that ādocument conversionā is too broad and deep a function to be a core feature (with many differing needs), and thus it would make sense to use an integration (of some sort) to outsource this to a dedicated service, so it can be better customized for each use case.
But it would be great if implementing such an integration was simpler!
I donāt mean to sound churlish, but although it may feel like āsomething so basicā to you, I donāt imagine it is a part of many peopleās knowledge workflows, let alone āa foundational capabilityā.
Iād even go as far as to say that Fibery is a ādata-centricā platform, rather than a 'workflow-centricā platform.
Anyway, Iām willing to be proven wrong - letās see what the votes say.
Iād say Fibery is an āinsight-centricā platform, or its moving towards being that, seen the developments in the last year.
If Fibery becomes the AI-co-operating system of an organization, then its insight-generating workflows based. @mdubakov ?