Jump to content
Chinese-Forums
  • Sign Up

Introducing Chinese Text Analyser


Recommended Posts

Posted

It's a known (to me) bug when there is a single line document without a newline at the end.

It's a rare enough use case that I haven't fixed it yet.

Thanks for reporting it though.

Posted

Try this:  Select several words on a single line.  Copy and paste in to a new CTA document.  Hover the mouse over the last word in the line.

 

You'll see the same thing.  The mouse hover/click currently miscalculates the last thing in the document.  This is most often an 'end of line' character so it's not a problem.  Occasionally there are documents that don't end with an end-of-line marker, and in those cases CTA won't highlight/enable clicks for the last word of those documents.

Posted

 

The mouse hover/click currently miscalculates the last thing in the document.

 

Interesting. Not sure if this is at all related, but I've noticed a couple of times that sometimes it seems like CTA thinks the words are higher/lower than they actually are. So instead of being able to select with the tip of the mouse pointer, it's more like the middle of the mouse pointer. I think in those instances I just had to close CTA and start it up again and everything was fine (maybe even just open a new document without closing CTA - can't remember). Anyways, it doesn't happen often at all, but I thought I'd mention it...

Posted

If it happens again, send me a copy of the document that caused it and if possible, reproduction steps.

Posted
in those cases CTA won't highlight/enable clicks for the last word of those documents.

 

Okay, got that.

 

But the thing with 夜猫 isn't just the inability to highlight/enable clicks: it's the stats on the right

 

 

Paste 野猫 (not the magic 夜猫!)

 

- as you say, can't click the word

- word statistics: Total: 1; known 1; % known: 100%; unknown 0

 

Paste 战猫 (not a word but two characters)

 

- as you say, can't click the last character

- word statistics: Total: 2; known 2; % known: 100%; unknown 0

 

Paste 夜猫

 

- as you say, can't click the word

- word statistics: Total: 0; known 0; % known: 100%; unknown 0

 

Edit: obviously it's no big deal! I was partly curious though because while reproducing the error, CTA froze on my twice and had to be restarted, and I couldn't remember the programme doing that to me before.

Posted
word statistics: Total: 0; known 0; % known: 100%; unknown 0

That is a problem.  From a correctness point of view, missing words are not good.

 

CTA freezing on you is also not good.  If you can provide more specific reproduction steps that would be useful.

Posted

Would it be possible to for me to replace the HSK information with TOCFL? I would prefer to have words getting their TOCFL level specified, which I use as a reference. And maybe TOCFL statics as well?

  • 5 weeks later...
Posted

I tried downloading it twice but the Mac version gets a "image not recognized" warning. I don't know how to get passed this.

Posted

Imron,

 

Awesome update! When I received an email announcing your new post, I was actually reading a novel in CTA! After 2+ years, I still use the program almost every day :)
 
The online dictionaries are a great new feature. I'm surprised, though, that you didn't include 百度词典 as a default dictionary. For me (in Canada), Baidu is much faster than ZDIC, MDBG, and LINE. And for users in China, Baidu would be much, much faster!
 
For anyone interested, the Baidu dictionary path is:

[baidu]
   order = 50

 

 
Incidentally, you can't use Chinese characters in the dictionary name. I am not sure if that is intended behavior. I tried putting 百度词典 as the dictionary name and it displayed as gobbledygook in CTA.
 
Thank you Imron for including documentation with this release. I read through it. Really well done. I did notice a tiny formatting issue. On the Online Lookup web page...
 
file:///C:/Program%20Files/ChineseTextAnalyser/docs/online-lookup.html
 
...the file paths don't look right. They appear without slashes or spaces:

C:Users<username>AppDataLocalChineseTextAnalyserdataonline-search
C:Program FilesChineseTextAnalyserdataonline-search

 

Since I'm posting, I may as well make a couple feature requests :mrgreen:
 
My #1 coveted feature is the ability to add dictionary definitions in the program itself. Currently this is a real pain. Let's say I have added 50 custom words in a month, and I want to add dictionary definitions for them. Here are the steps:
 
  1. Open words.u8
  2. Look up the 50 custom words in a dictionary
  3. Copy or write the definitions one by one into a text editor
  4. Double-check that each entry is in CEDICT format
  5. Open cedict_ts.u8
  6. Copy and paste my new entries
This is more work than it sounds, especially with lots of entries. It would be much more convenient to input definitions on the fly—i.e., fill out a pop-up form in CTA—whenever I add a custom word.
 
Here's a minor point. The issue might be with me  :lol: but I'll share it because it has tripped me up a few times.
 
Currently there are two different messages when closing an unsaved text. The first appears when you click [X] to close an open tab with unsaved text in it. The second appears when you close CTA and a tab with unsaved text in it is currently open. These are the messages:
 
  1. The contents of this document were copied from the clipboard. Would you like to save to a file before closing? [Default: Yes]
  2. You currently have unsaved documents open. Are you sure you wish to exit? [Default: Yes]
This catches my brain every time. So much so that I must think twice whenever I close a tab or the program. :wall I think it is because the two default "Yes" options do opposite things. One saves and closes; the other closes without saving. I am probably also used to Microsoft Word behavior, where attempting to close the program with unsaved work will ask if you want to save it, rather than confirm you want to exit without saving.
 
One final small thing. I love that CTA automatically bookmarks your location in a document. But it only does this with texts that are already saved. If you copy a text into CTA, read for a while, then close and save it, CTA will not remember your location.
 
This can be a pain if, say, you paste a novel into CTA and immediately start reading it. When you're done, you click [X] and save your document. CTA will not remember your place in this instance.
 
Imron, thank you for continuing to improve CTA! I hope you sell a million copies :) 
  • Like 2
Posted

I tried downloading it twice but the Mac version gets a "image not recognized" warning. I don't know how to get passed this.

It sounds to me like you have a corrupted download - twice, what are the chances right?

 

But I just downloaded it again now for Mac on Sierra and it worked fine.  One thing you can do to check is this.

 

First open up Terminal.

 

Then assuming you saved the file in ~/Downloads, type 

md5 ~/Downloads/chinese-text-analyser-0.99.16-install.dmg
The output should be something like:

 

MD5 (/Users/<your username>/Downloads/chinese-text-analyser-0.99.16-install.dmg) = 3ac6444404c314af83850dd9ad3eda73

 

The important part is the the number at the end.  It should be 3ac6444404c314af83850dd9ad3eda73

 

If the number is not the same, then the download has failed somehow.

@murrayjames, I'll write a longer response in a bit but just wanted to say

 

I hope you sell a million copies

Me too! Sales will have to pick up quite a bit for that to happen though :mrgreen:
Posted

I'm surprised, though, that you didn't include 百度词典 as a default dictionary

 

It slipped my mind.  Those others were the first three I could think of.  I'll put in in the next release.

 

 

Incidentally, you can't use Chinese characters in the dictionary name.

 

You can, you just need to make sure you save the file as UTF-8.  See attached pic. post-462-0-26200500-1483706627_thumb.png

 

...the file paths don't look right. They appear without slashes or spaces:

Good catch.  This will be fixed in the next release.

 

My #1 coveted feature is the ability to add dictionary definitions in the program itself.

The squeaky wheel gets the grease.  This has been on my todo list for ages.  I'll bump it up a few notches.

 

This catches my brain every time. So much so that I must think twice whenever I close a tab or the program. :wall I think it is because the two default "Yes" options do opposite things.

Excellent point.  I'll look to change this in the next version.

 

But it only does this with texts that are already saved. If you copy a text into CTA, read for a while, then close and save it, CTA will not remember your location.

Sounds like a bug.  It should be an easy fix.

 

Thanks for the excellent feedback.  Be sure to let me know if there's anything else you think could improve things.

  • New Members
Posted

a wild guess..do you change your IE page into Big-5under the view--->Encoding--->Chinese traditional Big-5 before you type your chinese?

Posted

Imron,

 

Thanks for your quick reply!

 

One thing I forgot to mention: Thank you for adding the dictionary lookup shortcut (Shift + double-click). With that and the toggle words known/unknown shortcut (double-click), I'm finding it even easier to move through a document. Who knew that right-clicking and selecting from a drop-down menu could take up so much time?  :lol:

 

 

a wild guess..do you change your IE page into Big-5under the view--->Encoding--->Chinese traditional Big-5 before you type your chinese? 

 

JamesLam, I'm not sure if that question is for me. I fixed it by following Imron's advice above: re-saving the "online-search" file in UTF-8 format. (A weird aside: I had to do this in Notepad. I tried saving in UTF-8 format in Notepad++, but for some reason it did not work.) 

Join the conversation

You can post now and select your username and password later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.

Guest
Unfortunately, your content contains terms that we do not allow. Please edit your content to remove the highlighted words below.
Click here to reply. Select text to quote.

×   Pasted as rich text.   Paste as plain text instead

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.

×
×
  • Create New...