standard extensions in 1.7.3

April 18, 2008 – 9:48 am by Bill

For 1.8, I’ve been moving towards a model where new functionality is added via “standard extensions”, that come with the sources and are installed in the “site-extensions” directory by default. It’s a model we’d like more feedback on. The standard extensions in the 1.7 release family include several rippers (to automatically extract metadata from NY Times articles, to automatically extract metadata using the CiteULike metadata extraction tools, to automatically extract metadata for articles from the ACM Digital Library site), and two extensions that provide some user interface functionality: the IMAP server and the UploadDocument extension.
[Read more…]

UpLib 1.7.3 available

April 17, 2008 – 5:39 pm by Bill

I’ve sent out the 1.7.3 release to our beta testers. This is the first one which comes with installers (for OS X 10.5 Intel, and for Linux). I hope this will allow more folks to try it out. I’m working on a Windows installer, but having a slow time making progress (I’m not very used to Windows, so I have to spend an inordinate amount of time double-checking what I do.)

Leptonica update - 1.54 bad

February 26, 2008 – 7:21 pm by Bill

I see that the findimages program won’t build if you use a recent version of Leptonica (version > 1.50). I’ve fixed this in the upcoming beta 3, but there’s still a problem with Leptonica 1.54 (it unwisely uses the fmemopen() function without testing to see if the platform supports it), so use 1.53 or 1.55 instead. I’m still using 1.46.

script

February 20, 2008 – 12:36 pm by Jaap

Hallo UpLib,

In <osx-install-prereqs.csh>  Terminal, bash, finds ‘(’ and  ’ )’  just after:

foreach file ( ${prereqdir}/*.tar.gz ${prereqdir}/*.tgz )  [here]
tar xvfz $file
end
foreach file ( ${prereqdir}/*.tar.bz2 )  
[and here]
bunzip2 < $file | tar xvf -

and does not like it. I did remove them. But  I have the feeling install is not complete. (script does go on and on for some time) How can one see  install is complete? I expect Uplib somewhere. (working on MAC, OS 10.4.11)

Jaap

Adding HTML docs to UpLib [updated]

February 13, 2008 – 11:36 pm by Bill

I’m frequently asked, “why can’t I add an HTML file to UpLib?” The answer is, basically, that UpLib deals in documents, and you can add an HTML file if it’s part of a document. The problem with HTML as a document format is that it was designed to live behind a Web server, and be pulled from that Web server. That Web server provides a context for the HTML, which can be used to resolve relative references to images, scripts, applets, and style sheets referenced from the HTML. If you take the Web server away, resolving the references becomes problematic, and the probability that you will only get part of the document becomes much higher.

A number of solutions to this problem have been devised.
[Read more…]

Getting started on UNIX (or OS X)

January 8, 2008 – 2:56 pm by Bill

If you are running OS X, or Linux, or Solaris, or other various UNIX-ish systems, setting up the prerequisites can be boring. I’ve written a couple of csh scripts to do the builds and patching and installs of everything, and put them up as “the script for OS X” and “the script for Linux”. Take a look at the first few lines to see what the arguments should be.

Getting started on Windows

December 20, 2007 – 9:30 pm by Bill

It’s hard to build UpLib from scratch on Windows, because (a) it’s hard to make software work on Windows, and (b) because there are lots of prerequisite packages which need to be installed, which makes it much harder.

So, here’s a recipe.
[Read more…]

Hi!

December 6, 2007 – 10:34 am by Bill

Hi! This is the blog for the UpLib project. Feel free to create an account and post something! We’re using the same account database for both the blog and the bug tracker, so after you’ve created a blog account, you can also use that same username & password to log into the bug tracker.

As an experiment, we’re going without a mailing list for UpLib, just having the blog.