Tamura

Tamura

Shinichi Tamura is currently working at Morooka Orthopedic Hospital at Seimeikai Medical Corporation. He attended Kyushu Electric Technology vocational college from 1989 – 1991, with an interest in focusing on AV equipment. Shinichi matriculated from the College of Electrical Engineering. After graduating, Shinichi went to work for Victor Service & Engineering at JVC KENWOOD Holdings, INC. There his primary duties included IT/Network systems design, Field management of construction sites, AV equipment and Facilities design, and JVC product repair. After a few months, Shinichi began working as the Secretary General and Systems Engineer for Morooka Orthopedic Hosptial.*

  • Tamura, S., Kawai, H., and Mitsumoto, H. (1993) Sex identification from 8 x 8 low resolution face images by neural network. Med. Imag. Tech. vol.11, (no.3):295-6.
  • Tamura, S., Taketani, H., Mitsumoto, H., Okazaki, H., and others (1992) Detection and normalization of pattern position by neural network. in RNNS/IEEE Symposium on Neuroinformatics and Neurocomputers (Cat. NO.92TH0483-8). New York, NY, USA: IEEE, p. 959-77 vol.2 of 2 vol. xxi+1270 pp.The authors propose a method of detecting and normalizing pattern position by the backpropagation network (BN) with three layers. First, they clarify the troublesome situation of restoring the shifted position by BPN experimentally, using a one-dimensional signal. Next, it is shown that by cascading two networks, one can detect and normalize the signal position and demonstrate its usefulness. This method is applied to a two-dimensional image, and its validity is confirmed.
  • Kim, E.-K., Wu, J.-T., Tamura, S., Sato, Y., Close, R. Taketani, H., Kawai, H., Inoue, M., and Ono, K. (1993) Comparison of neural network and k-NN classification methods in vowel and patellar subluxation image recognitions. International Journal of Pattern Recognition and Artificial Intelligence, vol.7, (no.4):775-82.We make a comparison of classification ability between BPN (BackPropagation Neural Network) and k-NN (k-Nearest Neighbor) classification methods. Voice data and patellar subluxation images are used. The result was that the average recognition rate of BPN was 9.2 percent higher than that of the k-NN classification method. Although k-NN classification is simple in theory, classification time was fairly long. Therefore, it seems that real time recognition is difficult. On the other hand, the BPN method has a long learning time but a very short recognition time. Especially if the number of dimensions of the samples is large, it can be said that BPN is better than k-NN in classification ability.
  • Taketani, H., Mitsumoto, H., Tamura, S., Okazaki, K., and others (1992) Detection and normalization of pattern location by neural network. Transactions of the Institute of Electronics, Information and Communication Engineers D-II, vol.J75D-II, (no.7):1260-70 (Japanese).The authors describe a method of detecting and normalizing pattern location by back propagation network (BPN) with three layers. First, the authors clarify the troublesome situation of compensating the shifted position by BPN experimentally, using one dimensional signals. Next, they propose a new method of solving this problem by cascading two networks where the signal location is detected and normalized, and they demonstrate its usefulness. They show the weight distribution has a characteristic of Fourier-series expansion. Finally they apply the method to 2-dimensional image. They compare three methods. As a result, the 2-dimensional location signal method that expands pattern and location signals into 2-dimensional ones is best from the view point of convergence in learning phase. The method of cascading two one-dimensional networks which normalize pattern first in horizontal direction by simply binding one-dimensional networks and then in vertical direction by the same method is considered. Although the correct answer rate of position normalization is not as good as the others, it does not need learning in 2-dimensional space and its normalization processing is fastest.
  • Kawai, H. and Tamura, S. (1992) Man-and-woman and individual classifications by mosaic facial images with different resolutions using neural network. Journal of the Institute of Television Engineers of Japan, vol.46, (no.1):93-6 (Japanese).Intensive studies have been made on individual classifications by facial images and analysis of facial expressions, etc. using neural networks from the viewpoint of learning. The authors carried out experimental classifications by 8*8, 16*16 and 32*32 mosaic facial images. As a result, man-and-woman classifications can be made with probability of 87% from 8*8 unknown images which are very difficult to analyze.
  • Furuya, T., Soeda, M., Kurosu, K., and Tamura, S. (1991) Speech recognition with lip movement data using an X-Y tracker. Transactions of the Society of Instrument and Control Engineers, vol.27, (no.8):958-65 (Japanese).Automatic syllable recognition is adversely affected by noises and similar syllables, Lip-reading may improve recognition. The authors propose such a system, for a finite number of words, using a microphone and an X-Y tracker. The tracker performs initial picture processing and yields co-ordinate pairs. These are processed together with sounds. The weight coefficients to change the importance of visual or voice data are introduced and selected to get the best performance for some registered words. The experiments were carried out to prove that the proposed system can improve the recognition rate in the presence of continuous noises. The results show improvements of about 10% and about 20% for SNRs of 35 dB and 26 dB respectively.
  • Wu, J.-T., Tamura, S., Mitsumoto, H., Kawai, H., Kurosu, K., and Okazaki, K. (1991) Neural network vowel-recognition jointly using voice features and mouth shape image. Pattern Recognition, vol.24, (no.10):921-7.Describes a neural approach intended to improve the performance of an automatic speech recognition system for unrestricted speakers by using not only voice sound features but also image features of the mouth shape. In particular, the authors used the natural sample voice signals and mouth shape images that were acquired in the general environment, neither in the sound isolation room nor under specific lighting conditions. The FFT power spectrum of acoustic speech was used as the voice feature. In addition, the gray level image, binary image and geometrical shape features of the mouth were used as the compensatory information, and compared to find which kinds of image features were effective. This method can be applied not only to the improvement of voice recognition, but also to aid the communication of hearing-impaired people.
  • Wu, J.-T., Tamura, S., Mitsumoto, H., Kawai, H., and others (1991) Speaker-independent vowel recognition combining voice features and mouth shape image with neural network. Systems and Computers in Japan, vol.22, (no.4):100-9.The paper describes a neural approach intended to improve the performance of a voice recognition system for unrestricted speakers using not only voice sound features but also image features of the mouth shape. The FFT power spectrum of acoustic speech was used as the voice feature. In addition, the gray-level image, binary image, and geometrical shape features of the mouth were used as the compensatory information and a comparison made of which kinds of image features are effective for voice recognition by a neural network. For unrestricted speakers, a vowel recognition rate of about 80 percent was obtained using only voice features. However, this increased to some 92 percent when voice features plus binary images were used. This method can be applied not only to the improvement of voice recognition, but also to aid the communication of hearing-impaired people.
  • Wu, J.-T., Tamura, S., Mitsumoto, H., Kawai, H., Kurosu, K., and Okazaki, K. (1990) Neural network vowel-recognition jointly using voice features and mouth shape image. Transactions of the Institute of Electronics, Information and Communication Engineers D-II, vol.J73D-II, (no.8):1309-14 (Japanese).This paper describes a neural approach intended to improve the performance of a voice recognition device by using not only voice sound features but also image features of the mouth shape. The FFT power spectrum was used as the voice feature. In addition, the gray level image, binary image, and geometrical shape features of the mouth was tested for comparison to check which kinds of features are effective for voice recognition by a neural network. For unrestricted speakers, a vowel recognition rate of about 80% was obtained using voice only features, but this increased to some 92% when voice features plus binary images were used. This method can be applied not only to the improvement of the voice recognition, but also to aid the communication of hearing impaired people.
  • Mitsumoto, H., Okazaki, K., Okazaki, K., Kajimi, N., Tamura, S., Kawai, H., and Fukui, Y. (1990) Lip contour extraction, complement, and tracing by using energy function and optical flow. Transactions of the Information Processing Society of Japan, vol.31, (no.3):444-53 (Japanese).The article mentions: lip contour extraction by zero-crossing method, lip contour tracking.
  • Tamura, S., Kawasaki, S. (1988) Recognition of sign language motion images. Pattern Recognition vol.21, (no.4): 343-353.

Comments are closed.