Close Menu
    Main Menu
    • Home
    • News
    • Tech
    • Robotics
    • ML & Research
    • AI
    • Digital Transformation
    • AI Ethics & Regulation
    • Thought Leadership in AI

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    What's Hot

    High 7 AI Agent Orchestration Frameworks

    March 12, 2026

    iRobot is bringing the Roomba Mini to the U.Ok. and Europe

    March 12, 2026

    AI use is altering how a lot firms pay for cyber insurance coverage

    March 12, 2026
    Facebook X (Twitter) Instagram
    UK Tech InsiderUK Tech Insider
    Facebook X (Twitter) Instagram
    UK Tech InsiderUK Tech Insider
    Home»Emerging Tech»Factify needs to maneuver previous PDFs and .docx by giving digital paperwork their very own mind
    Emerging Tech

    Factify needs to maneuver previous PDFs and .docx by giving digital paperwork their very own mind

    Sophia Ahmed WilsonBy Sophia Ahmed WilsonJanuary 29, 2026No Comments7 Mins Read
    Facebook Twitter Pinterest Telegram LinkedIn Tumblr Email Reddit
    Factify needs to maneuver previous PDFs and .docx by giving digital paperwork their very own mind
    Share
    Facebook Twitter LinkedIn Pinterest Email Copy Link



    Tel Aviv-based startup Factify emerged from stealth immediately with a $73 million seed spherical for an bold, but quixotic mission: to convey digital paperwork past the usual codecs most companies use — .PDF, .docx, collaborative cloud recordsdata like Google Docs — and into the intelligence period.

    For Matan Gavish, Factify’s Founder and CEO, this isn't only a software program improve—it’s an inevitability he has been obsessive about for years.

    "The PDF was developed after I was in elementary faculty," Gavish advised VentureBeat. "The bedrock of the software program ecosystem hasn't actually developed… somebody has to revamp the digital doc itself."

    Gavish, a tenured professor of laptop science and Stanford PhD, admits that his fixation on administrative file codecs is an anomaly for somebody along with his credentials.

    "It's a really uncool downside to be obsessive about," he says. "Given the truth that my tutorial background is AI and machine studying, my mother wished me to start out an AI firm as a result of it's cool. I'm unsure why I'm obsessed after which possessed by paperwork."

    However that obsession has now attracted a sizeable seed spherical led by Valley Capital Companions and backed by AI heavyweights like former Google AI chief John Giannandrea.

    The guess is straightforward the static rigidity of most digital recordsdata has restricted their utility, and a greater, extra clever doc that really shares its edit historical past and possession with customers as supposed, is just not solely doable — it's a multi-billion-dollar alternative.

    The historical past of digital paperwork

    To grasp why a seed spherical would balloon to $73 million, it’s a must to perceive the dimensions of the entice companies are in. There are at the moment an estimated three trillion PDFs in circulation. "Some folks see the PDF greater than they see their children," Gavish jokes.

    The historical past of the digital doc is just not a linear development the place one format replaces one other. As a substitute, it’s a story of "speciation," the place completely different codecs developed to fill distinct ecological niches: creation, distribution, and collaboration.

    The period of recordsdata: Microsoft Phrase (Eighties–Nineties)

    Digital paperwork started as remoted artifacts. Within the Eighties, the "doc" was inextricably linked to the {hardware} that created it. A file created in WordPerfect on a DOS machine was successfully gibberish to a Macintosh consumer.

    Microsoft Phrase, which traces its lineage to the pioneering WYSIWYG editors at Xerox PARC, modified this by leveraging the dominance of the Home windows working system. By the Nineties, the binary .doc format turned the default container for editable skilled paperwork. Nevertheless, these recordsdata have been structurally complicated "reminiscence dumps" designed for the restricted {hardware} of the time, usually resulting in corruption or privateness leaks the place deleted textual content remained hidden within the file's binary information.

    The period of digital 'stone': the PDF (Nineties-2006)

    The PDF didn’t originate as a software for writing; it was a software for viewing. In 1991, Adobe co-founder John Warnock penned the "Camelot Mission" white paper, envisioning a "digital envelope" that might look an identical on any show or printer.

    In contrast to Phrase recordsdata, which have been malleable, PDFs have been designed to be immutable. They used the PostScript imaging mannequin to position characters at exact coordinates, guaranteeing visible constancy. Whereas adoption was initially gradual, Adobe’s 1994 determination to launch the Acrobat Reader free of charge established PDF as the worldwide normal for "digital concrete"—the format of finality used for contracts, authorities kinds, and archives.

    The collaborative cloud docs period (2006-present)

    In 2006, Google disrupted the mannequin once more by shifting the doc from the onerous drive to the browser. Utilizing "Operational Transformation" algorithms, Google Docs allowed a number of customers to edit the identical stream of textual content concurrently.

    This shifted the paradigm from "sending a file" to "sharing a hyperlink." Whereas Google Workspace now claims over 3 billion customers (largely customers and schooling), it basically modified how we work—turning paperwork into dwelling, collaborative processes somewhat than static artifacts.

    The established order: fragmentation

    Regardless of these advances, the enterprise world stays fragmented. We draft in Google Docs (the "Digital Stream"), format in Phrase (the "Digital Clay"), and sign up PDF (the "Digital Stone").

    However this fragmentation has a value. "The issue is just not the doc. It’s all the things round it," the corporate notes. "As soon as a PDF leaves your system, management is gone. Variations drift. Entry is unclear. Nothing is seen."

    Turning digital paperwork into clever infrastructure

    Factify’s wager is that within the age of AI, this fragmentation is now not simply annoying—it’s a essential failure. AI fashions want structured, verifiable information to operate.

    When an AI "reads" a PDF, it’s basically guessing, utilizing optical character recognition to scrape textual content from what’s successfully a digital photograph.

    "What we're coping with here’s a megalomaniac imaginative and prescient, however it's on the identical time most likely one thing that’s inevitable," Gavish says.

    Factify’s answer is to deal with paperwork not as static recordsdata, however as clever infrastructure. Within the "Factified" normal, a doc carries its personal mind. It possesses a novel identification, a stay permission system, and an immutable audit log that travels with it.

    "We wrote a brand new doc format that supplants the PostScript," Gavish explains. "We created a brand new information layer that helps the doc as a first-class citizen… and it's all the time obtainable contained in the group and doubtlessly exterior."

    This distinction—between a File and an API—is the core of the corporate's pitch"

    • Information are liabilities: They accumulate, get misplaced, and could be stolen. "It goes again to a brick standing," Gavish says. "Information are liabilities, if something, as a result of they only accumulate there, it’s a must to guard them."

    • APIs are property: A Factify doc is an energetic object. You possibly can ask it questions: "Who has seen you? When do you expire? Are you probably the most up-to-date model?"

    'Folks don't change', however codecs do

    Historical past is plagued by codecs that attempted to exchange the PDF (like Microsoft’s XPS). They failed as a result of they demanded an excessive amount of behavioral change from customers. Gavish is keenly conscious of this entice.

    "After I speak to enterprise software program entrepreneurs, I inform them the 2 legal guidelines to learn about beginning an organization in enterprise software program is that individuals don't care, and nobody modifications," he says.

    To skirt this, Factify has constructed deep backwards compatibility. A Factified doc can look precisely like a PDF, full with web page breaks and margins. Customers don't have to study a brand new interface to get worth; they only want to unravel a selected ache level—like an govt who needs to make sure an funding memo can’t be forwarded.

    "All they’ve to inform their workforce is, 'Expensive Chief of Workers, employment agreements and funding memoranda… are going to be Factified. The remaining keep it up,'" Gavish says. "They see speedy profit… however then they uncover that they've crossed the Rubicon."

    What's subsequent for Factify?

    The capital from this spherical shall be used to deepen the platform's core engineering—which Gavish describes as a "heavy engineering raise" requiring them to rebuild the doc format, information layer, and utility layer from scratch. The corporate can be establishing a significant operational hub in Pittsburgh to help its U.S. enlargement.

    In the end, Factify isn't making an attempt to construct one other collaboration software like Google Docs. They’re making an attempt to construct the immutable report of the longer term—the usual for "reality" in a digital world.

    "The PDF… turned a regular which means I can’t file my taxes utilizing every other format. That is how victory seems to be like," Gavish says. "We’re making a doc normal that’s not particular for well being care or for insurance coverage, however is simply doc as such."

    For the three trillion static recordsdata at the moment sitting in cloud storage, the writing might lastly be on the wall.

    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
    Sophia Ahmed Wilson
    • Website

    Related Posts

    AI-Powered Cybercrime Is Surging. The US Misplaced $16.6 Billion in 2024.

    March 12, 2026

    Nvidia's new open weights Nemotron 3 tremendous combines three totally different architectures to beat gpt-oss and Qwen in throughput

    March 12, 2026

    Claude Now Integrates Extra Intently With Microsoft Excel and PowerPoint

    March 11, 2026
    Top Posts

    Evaluating the Finest AI Video Mills for Social Media

    April 18, 2025

    Utilizing AI To Repair The Innovation Drawback: The Three Step Resolution

    April 18, 2025

    Midjourney V7: Quicker, smarter, extra reasonable

    April 18, 2025

    Meta resumes AI coaching utilizing EU person knowledge

    April 18, 2025
    Don't Miss

    High 7 AI Agent Orchestration Frameworks

    By Oliver ChambersMarch 12, 2026

    Picture by Writer   # Introduction  AI brokers assist construct autonomous programs that may plan, use…

    iRobot is bringing the Roomba Mini to the U.Ok. and Europe

    March 12, 2026

    AI use is altering how a lot firms pay for cyber insurance coverage

    March 12, 2026

    AI-Powered Cybercrime Is Surging. The US Misplaced $16.6 Billion in 2024.

    March 12, 2026
    Stay In Touch
    • Facebook
    • Twitter
    • Pinterest
    • Instagram
    • YouTube
    • Vimeo

    Subscribe to Updates

    Get the latest creative news from SmartMag about art & design.

    UK Tech Insider
    Facebook X (Twitter) Instagram
    • About Us
    • Contact Us
    • Privacy Policy
    • Terms Of Service
    • Our Authors
    © 2026 UK Tech Insider. All rights reserved by UK Tech Insider.

    Type above and press Enter to search. Press Esc to cancel.