Click here to monitor SSC
SQLServerCentral is supported by Redgate
 
Log in  ::  Register  ::  Not logged in
 
 
 
        
Home       Members    Calendar    Who's On


Add to briefcase

Word Doc in SQL Server! Expand / Collapse
Author
Message
Posted Tuesday, May 14, 2013 8:55 AM
SSC-Addicted

SSC-AddictedSSC-AddictedSSC-AddictedSSC-AddictedSSC-AddictedSSC-AddictedSSC-AddictedSSC-Addicted

Group: General Forum Members
Last Login: Monday, April 25, 2016 2:14 AM
Points: 403, Visits: 1,038

Hi All
I wish to store a word document in SQL Server. I should be able to search on the content. e.g. Select Content from DocTable where content like '%this doc is good%'.
I want to retain basic formatting like old,underline , italics.
Is there a way to achieve this without saving the document as HTML?
Thanks


-----Table Proc Index Performance TSQL &&%$#@*(#@$%.......------------
Deep Into SQL Jungle
Post #1452650
Posted Tuesday, May 14, 2013 9:02 AM


SSChampion

SSChampionSSChampionSSChampionSSChampionSSChampionSSChampionSSChampionSSChampionSSChampionSSChampion

Group: General Forum Members
Last Login: Today @ 9:44 AM
Points: 14,189, Visits: 37,099
S_Kumar_S (5/14/2013)

Hi All
I wish to store a word document in SQL Server. I should be able to search on the content. e.g. Select Content from DocTable where content like '%this doc is good%'.
I want to retain basic formatting like old,underline , italics.
Is there a way to achieve this without saving the document as HTML?
Thanks


you'll want to use Full text indexing, which has the option to scan various types of documents: you definitely don't reinvent the wheel when someone has built a race car for the same issue.

as far as formatting goes, the formatting in the document is still in place, but it's broken up into search words;
not sure what it is you are after as far as formatting goes.

http://msdn.microsoft.com/en-us/library/ms142571.aspx


Filters. Some data types require filtering before the data in a document can be full-text indexed, including data in varbinary, varbinary(max), image, or xml columns. The filter used for a given document depends on its document type. For example, different filters are used for Microsoft Word (.doc) documents, Microsoft Excel (.xls) documents, and XML (.xml) documents. Then the filter extracts chunks of text from the document, removing embedded formatting and retaining the text and, potentially, information about the position of the text. The result is a stream of textual information. For more information, see Configure and Manage Filters for Search.


Lowell

--
help us help you! If you post a question, make sure you include a CREATE TABLE... statement and INSERT INTO... statement into that table to give the volunteers here representative data. with your description of the problem, we can provide a tested, verifiable solution to your question! asking the question the right way gets you a tested answer the fastest way possible!
Post #1452652
Posted Wednesday, May 14, 2014 1:32 AM
Grasshopper

GrasshopperGrasshopperGrasshopperGrasshopperGrasshopperGrasshopperGrasshopperGrasshopper

Group: General Forum Members
Last Login: Wednesday, May 21, 2014 3:51 AM
Points: 10, Visits: 31
S_Kumar_S (5/14/2013)

Hi All
I wish to
store a word document in SQL Server. I should be able to search on the content. e.g. Select Content from Word DocTable where content like '%this doc is good%'.
I want to retain basic formatting like old,underline , italics.
Is there a way to achieve this without saving the document as HTML?
Thanks


You want to make the search key words contain basic formatting? like this,

http://msdn.microsoft.com/en-us/library/office/aa211953%28v=office.11%29.aspx
Post #1570665
« Prev Topic | Next Topic »

Add to briefcase

Permissions Expand / Collapse