Data Science

Bytecode is used in Java, Python, and other languages
However, not all bytecode is created equal
Bytecode is a generic term for an intermediate language used by compilers and interpreters
For example, Java bytecode contains information about primitive data types
On the other hand, Python bytecode does not contain this information
As a result, the Python virtual machine (PVM) is slower than the Java virtual machine (JVM)
Specifically, the bytecode in the PVM takes longer to execute than the bytecode in the JVM

During run-time, the bytecode is interpreted by a JVM interpreter within the JVM
Before interpretation, a JIT compiler compiles the bytecode into machine code within the JVM
Unlike Python, Java is able to do this because Java is statically-typed
Therefore, type checking has already happened during compile-time
Returning to the illustration of an interpreter, the JVM interpreter looks like the following:
```
bytecode, input --> | JVM interpreter | --> output
```
Since the data types of input are known, a JIT compiler can be used at run-time
Therefore, the JVM compilation process looks like the following:
```
bytecode --> | JIT compiler | --> machine code
```
With the addition of the JIT compiler at run-time, the JVM interpreter now looks like this:
```
machine code, input --> | JVM interpreter | --> output
```
Bytecode is platform dependent
Machine code is platform dependent
Specifically, there is different machine code for different processors
This is why the JIT compiler exists within the JVM and can't be compiled beforehand

At compile time:
- Language syntax is checked
- Data types are checked for statically-typed languages
At run time:
- Computations such as addition, division, etc.
- Data types are checked for dynamically-typed languages

Generally, a dynamically-typed language executes many common programming behaviors at runtime
A statically-typed language is able to execute these behaviors at compile time
This is because statically-typed languages give the compiler much more information (e.g. variable types, etc.)
Specifically, the compiler has information about the structure of the program and its data
With this information, the compiler will be able to optimize both memory access and computations
As a result, statically-typed languages are generally faster than dynamically-typed languages

Essentially, the bytecode of a statically-typed language will run faster compared to the bytecode of a dynamically-typed language
This is because bytecode of statically-typed languages still need to determine information like variable types
- Statically-typed languages have already done this
- Dynamically-typed languages need to do this because a user could pass a variable as a list, integer, etc. at runtime
To effectively compile dynamically-typed languages:
1. Enforce a static structure of data
2. Infer the types of all variables, classes, etc.
A compiler of a dynamically-typed language could enforce the above conditions
However, implementing these additional checks and inferences leads to larger bytecode
Meaning, running the bytecode becomes slower

Parallelism

The GIL