You are viewing limited content. For full access, please sign in.

Question

Question

How to return total "content" count in whole repository with workflow

asked on February 16, 2016 Show version history

Hi all,

 

I know I can manually see the total document count in properties of the root folder as an admin:

ie:

Child folders: 14,212

Child documents: 51,740

I also know I can retrieve a total content count using the search syntax:

{LF:LOOKIN="ReposRoot\"}

That returns: 65,953

Which evidently includes all the folders and child documents (and the root folder itself) (14,212+51,740=65,952)

Furthermore I know I can return a total count of OCRable documents (ie those with images/pages >0) with:

{LF:pagecount > 0} & {LF:LOOKIN="ReposRoot\"}

That returns: 31,140

Demonstrating that only about half the repository is imaged entries amenable to OCRing.

 

What I want to be able to do is run workflow that produces some simple metrics:

 

Total Content Count (excluding folders; ie Child Documents in the context menu properties sense)

??

Total OCRable Documents:

{LF:pagecount > 0} & {LF:LOOKIN="ReposRoot\"}

Total OCR=ALL, OCR=Some and OCR=None counts:

{LF:AssociatedPages="Y"} & {LF:OCR=all/some/none})&{LF:pagecount > 0} & {LF:LOOKIN="ReposRoot\"}

After putting all those in tokens I can do the math and send an email.

So...

How to automate the calculation of the "Child Documents" figure with workflow?

 

Best,

Will

0 0

Answer

SELECTED ANSWER
replied on February 18, 2016 Show version history

No, type D is document. Folders would be "F".

1 0

Replies

replied on February 17, 2016

You can use another "Search Repository" activity in Workflow with the following syntax

{LF:Name="*", Type="D"}

and then use the "Result Count" token for that activity.

1 0
replied on February 17, 2016

I'd just get the values directly from the SQL database. The search will work, but you would only get the count of the entries the connection profile user has rights to see. And since you're not actually doing anything with the entries, you're just making the server calculate security for nothing, so the search will get slower and consume more resources the bigger the repository gets.

The TOC table has a list of all the entries, the DOC table has a list of all the image and text pages.

  select count(*) from toc where etype = -2

select count(*)
  from [doc]
  where img_size is not null and txt_size =0

 

1 0
replied on February 17, 2016

Type D is directory count that I can sub from all?

Both answers most informative and helpful. Thank you. 

W

0 0
SELECTED ANSWER
replied on February 18, 2016 Show version history

No, type D is document. Folders would be "F".

1 0
replied on February 18, 2016 Show version history

Thank you Miruna!

0 0
You are not allowed to follow up in this post.

Sign in to reply to this post.