Jump to content
Claris Engage 2025 - March 25-26 Austin Texas ×

Need help with getting text data from web site.


This topic is 5546 days old. Please don't post here. Open a new topic instead.

Recommended Posts

Posted

This is my first post at this community. I've got a problem with getting text data from web site.

If it's just a html web site, you can use "GetLayoutObjectAttribute("web viewer"; content)" to parse all the html written in the web page.

But if I do the same thing in order to get text data from the web like http://hoge.com/test.txt,

unnecessary text part come with it.( like html header part)

I've found the plug-in "Troi URL", with that I can parse the only part of the text I want.

But if there is another way to do this without plug-in, that'll be awesome.

Because I want to make an exe file with Developer Utility mode.

(I guess that making exe file doesnt support plug-in)

sorry for my poor english.

If you have any idea, help me please.

Posted (edited)

A search of this Topic Area will provide a few Threads on this Subject. Here is one that has a couple of demonstration files that helped me get started. Link

BTW, the first one worked for me better than the modified one.

HTH

Lee

Edited by Guest
Posted

Hi, Lee! Thanks for reply =)

I took a look at the link you put.

But, that was not about I'm talking about.

the thing on your link was about how to get rid of all the tags, and make it as text.

What I was trying to mean was about

how to get real .txt file from web site.

http://yutori7.2ch.net/ogame2/dat/1253872803.dat

If you go to this web site,

each comment(Record) is separated by every line,

and every field is separated by "<>"

*sorry, its all written in Japanese.

I want it how it's shown as you see that dat file with notepad(Windows App)

But if I try to get its text with web viewer, unnecessary header comes with it every time.

like:'<?xml version="1.0" encoding="Shift_JIS"?>

When you get data from web viewer, it always adds unnecessary part somehow.

I wonder if there is any way to solve this without plug-in??

Posted

I don't think that there is a plugin that will do what you describe.

Before Web Viewer, I would past the web page into BBEdit or TextWrangler, and remove the unwanted information using Grep Patterns, and a AppleScript, and then import the data into Filemaker.

Using the Web Viewer, I have adapted the file I pointed to, and use a combination of Custom Functions that can be found at fmfunctions and Brian Dunning, along with the functions that you would have used in the past such as Right, Left, Substitute, ScriptMaker, etc. to parse data.

HTH

Lee

This topic is 5546 days old. Please don't post here. Open a new topic instead.

Create an account or sign in to comment

You need to be a member in order to leave a comment

Create an account

Sign up for a new account in our community. It's easy!

Register a new account

Sign in

Already have an account? Sign in here.

Sign In Now
×
×
  • Create New...

Important Information

By using this site, you agree to our Terms of Use.