Tel Aviv-based startup Factify emerged from stealth immediately with a $73 million seed spherical for an bold, but quixotic mission: to convey digital paperwork past the usual codecs most companies use — .PDF, .docx, collaborative cloud recordsdata like Google Docs — and into the intelligence period.
For Matan Gavish, Factify’s Founder and CEO, this isn't only a software program improve—it’s an inevitability he has been obsessive about for years.
"The PDF was developed after I was in elementary faculty," Gavish advised VentureBeat. "The bedrock of the software program ecosystem hasn't actually developed… somebody has to revamp the digital doc itself."
Gavish, a tenured professor of laptop science and Stanford PhD, admits that his fixation on administrative file codecs is an anomaly for somebody along with his credentials.
"It's a really uncool downside to be obsessive about," he says. "Given the truth that my tutorial background is AI and machine studying, my mother wished me to start out an AI firm as a result of it's cool. I'm unsure why I'm obsessed after which possessed by paperwork."
However that obsession has now attracted a sizeable seed spherical led by Valley Capital Companions and backed by AI heavyweights like former Google AI chief John Giannandrea.
The guess is straightforward the static rigidity of most digital recordsdata has restricted their utility, and a greater, extra clever doc that really shares its edit historical past and possession with customers as supposed, is just not solely doable — it's a multi-billion-dollar alternative.
The historical past of digital paperwork
To grasp why a seed spherical would balloon to $73 million, it’s a must to perceive the dimensions of the entice companies are in. There are at the moment an estimated three trillion PDFs in circulation. "Some folks see the PDF greater than they see their children," Gavish jokes.
The historical past of the digital doc is just not a linear development the place one format replaces one other. As a substitute, it’s a story of "speciation," the place completely different codecs developed to fill distinct ecological niches: creation, distribution, and collaboration.
The period of recordsdata: Microsoft Phrase (Eighties–Nineties)
Digital paperwork started as remoted artifacts. Within the Eighties, the "doc" was inextricably linked to the {hardware} that created it. A file created in WordPerfect on a DOS machine was successfully gibberish to a Macintosh consumer.
Microsoft Phrase, which traces its lineage to the pioneering WYSIWYG editors at Xerox PARC, modified this by leveraging the dominance of the Home windows working system. By the Nineties, the binary .doc format turned the default container for editable skilled paperwork. Nevertheless, these recordsdata have been structurally complicated "reminiscence dumps" designed for the restricted {hardware} of the time, usually resulting in corruption or privateness leaks the place deleted textual content remained hidden within the file's binary information.
The period of digital 'stone': the PDF (Nineties-2006)
The PDF didn’t originate as a software for writing; it was a software for viewing. In 1991, Adobe co-founder John Warnock penned the "Camelot Mission" white paper, envisioning a "digital envelope" that might look an identical on any show or printer.
In contrast to Phrase recordsdata, which have been malleable, PDFs have been designed to be immutable. They used the PostScript imaging mannequin to position characters at exact coordinates, guaranteeing visible constancy. Whereas adoption was initially gradual, Adobe’s 1994 determination to launch the Acrobat Reader free of charge established PDF as the worldwide normal for "digital concrete"—the format of finality used for contracts, authorities kinds, and archives.
The collaborative cloud docs period (2006-present)
In 2006, Google disrupted the mannequin once more by shifting the doc from the onerous drive to the browser. Utilizing "Operational Transformation" algorithms, Google Docs allowed a number of customers to edit the identical stream of textual content concurrently.
This shifted the paradigm from "sending a file" to "sharing a hyperlink." Whereas Google Workspace now claims over 3 billion customers (largely customers and schooling), it basically modified how we work—turning paperwork into dwelling, collaborative processes somewhat than static artifacts.
The established order: fragmentation
Regardless of these advances, the enterprise world stays fragmented. We draft in Google Docs (the "Digital Stream"), format in Phrase (the "Digital Clay"), and sign up PDF (the "Digital Stone").
However this fragmentation has a value. "The issue is just not the doc. It’s all the things round it," the corporate notes. "As soon as a PDF leaves your system, management is gone. Variations drift. Entry is unclear. Nothing is seen."
Turning digital paperwork into clever infrastructure
Factify’s wager is that within the age of AI, this fragmentation is now not simply annoying—it’s a essential failure. AI fashions want structured, verifiable information to operate.
When an AI "reads" a PDF, it’s basically guessing, utilizing optical character recognition to scrape textual content from what’s successfully a digital photograph.
"What we're coping with here’s a megalomaniac imaginative and prescient, however it's on the identical time most likely one thing that’s inevitable," Gavish says.
Factify’s answer is to deal with paperwork not as static recordsdata, however as clever infrastructure. Within the "Factified" normal, a doc carries its personal mind. It possesses a novel identification, a stay permission system, and an immutable audit log that travels with it.
"We wrote a brand new doc format that supplants the PostScript," Gavish explains. "We created a brand new information layer that helps the doc as a first-class citizen… and it's all the time obtainable contained in the group and doubtlessly exterior."
This distinction—between a File and an API—is the core of the corporate's pitch"
-
Information are liabilities: They accumulate, get misplaced, and could be stolen. "It goes again to a brick standing," Gavish says. "Information are liabilities, if something, as a result of they only accumulate there, it’s a must to guard them."
-
APIs are property: A Factify doc is an energetic object. You possibly can ask it questions: "Who has seen you? When do you expire? Are you probably the most up-to-date model?"
'Folks don't change', however codecs do
Historical past is plagued by codecs that attempted to exchange the PDF (like Microsoft’s XPS). They failed as a result of they demanded an excessive amount of behavioral change from customers. Gavish is keenly conscious of this entice.
"After I speak to enterprise software program entrepreneurs, I inform them the 2 legal guidelines to learn about beginning an organization in enterprise software program is that individuals don't care, and nobody modifications," he says.
To skirt this, Factify has constructed deep backwards compatibility. A Factified doc can look precisely like a PDF, full with web page breaks and margins. Customers don't have to study a brand new interface to get worth; they only want to unravel a selected ache level—like an govt who needs to make sure an funding memo can’t be forwarded.
"All they’ve to inform their workforce is, 'Expensive Chief of Workers, employment agreements and funding memoranda… are going to be Factified. The remaining keep it up,'" Gavish says. "They see speedy profit… however then they uncover that they've crossed the Rubicon."
What's subsequent for Factify?
The capital from this spherical shall be used to deepen the platform's core engineering—which Gavish describes as a "heavy engineering raise" requiring them to rebuild the doc format, information layer, and utility layer from scratch. The corporate can be establishing a significant operational hub in Pittsburgh to help its U.S. enlargement.
In the end, Factify isn't making an attempt to construct one other collaboration software like Google Docs. They’re making an attempt to construct the immutable report of the longer term—the usual for "reality" in a digital world.
"The PDF… turned a regular which means I can’t file my taxes utilizing every other format. That is how victory seems to be like," Gavish says. "We’re making a doc normal that’s not particular for well being care or for insurance coverage, however is simply doc as such."
For the three trillion static recordsdata at the moment sitting in cloud storage, the writing might lastly be on the wall.

